The Open 
University 


Mathematical methods 


BOOK | 


Differential equations 


MST224 
Mathematical methods 


Book 1 


Differential equations 


The Open 


University 


Cover image: This shows a simulation of the patterns formed by smoke particles moving in air, 
which is itself in turbulent motion. The positions of the particles are described by relatively simple 
differential equations, yet the patterns that they form are complex and intriguing. Similar patterns 
are also relevant to understanding how clouds produce rain, and are a subject of ongoing research at 
The Open University. 


This publication forms part of an Open University module. Details of this and other 

Open University modules can be obtained from the Student Registration and Enquiry Service, The 
Open University, PO Box 197, Milton Keynes MK7 6BJ, United Kingdom (tel. +44 (0)845 300 6090; 
email general-enquiries@open.ac.uk). 

Alternatively, you may visit the Open University website at www.open.ac.uk where you can learn 
more about the wide range of modules and packs offered at all levels by The Open University. 


To purchase a selection of Open University materials visit www.ouw.co.uk, or contact Open 
University Worldwide, Walton Hall, Milton Keynes MK7 6AA, United Kingdom for a brochure 
(tel. +44 (0)1908 858779; fax +44 (0)1908 858787; email ouw-customer-services@open.ac.uk). 


MIX 
Paper from 


responsible sources 


— FSC® C016278 


The Open University, Walton Hall, Milton Keynes, MK7 6AA. 
First published 2013. 
Copyright © 2013 The Open University 


All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, transmitted 
or utilised in any form or by any means, electronic, mechanical, photocopying, recording or otherwise, without 
written permission from the publisher or a licence from the Copyright Licensing Agency Ltd. Details of such 
licences (for reprographic reproduction) may be obtained from the Copyright Licensing Agency Ltd, Saffron 
House, 6-10 Kirby Street, London EC1N 8TS (website www.cla.co.uk). 


Open University materials may also be made available in electronic formats for use by students of the 
University. All rights, including copyright and related rights and database rights, in electronic materials and 
their contents are owned by or licensed to The Open University, or otherwise used by The Open University as 
permitted by applicable law. 


In using electronic materials and their contents you agree that your use will be solely for the purposes of 
following an Open University course of study or otherwise as licensed by The Open University or its assigns. 


Except as permitted above you undertake not to copy, store in any medium (including electronic storage or 
use in a website), distribute, transmit or retransmit, broadcast, modify or show in public such electronic 
materials in whole or in part without the prior written consent of The Open University or in accordance with 
the Copyright, Designs and Patents Act 1988. 


Edited, designed and typeset by The Open University, using the Open University TX System. 
Printed in the United Kingdom by Halstan & Co. Ltd, Amersham, Bucks. 


ISBN 978 1 7800 7479 5 
Lt 


Contents 


Unit 1 Getting started 


Introduction 


1 


Some elementary functions 

1.1 Functions, variables and parameters 
1.2 Linear functions 

1.3. Quadratic functions 

1.4 Powers 

1.5 Combining functions 


The exponential and logarithm functions 
2.1 The exponential function 
2.2 The natural logarithm 


Trigonometric functions 

3.1 Introducing the trigonometric functions 
3.2 Inverse trigonometric functions 

3.3 Some useful trigonometric identities 


Complex numbers 

4.1 The arithmetic of complex numbers 
4.2 Polar form of complex numbers 

4.3 Euler’s formula 


Differentiation 

5.1 Derivative as a rate of change 

5.2 Differentiating combinations of functions 
5.3 Investigating functions 


Integration 

6.1 Reversing differentiation 

6.2 Evaluating integrals 

6.3. Integration by substitution and by parts 
6.4 Definite integrals 


Learning outcomes 


Solutions to exercises 


Acknowledgements 


Contents 


Contents 


Unit 2. First-order differential equations 85 
Introduction 87 
1 Some basics 91 
1.1 Where do differential equations come from? 92 

1.2 Differential equations and their solutions 96 

2 Direct integration and separation of variables 102 
2.1 Direct integration 102 
2.2 Separation of variables 105 

3 Solving linear differential equations ai} 
3.1 Linear differential equations 111 
3.2 Linear constant-coefficient equations 112 
3.3. The integrating factor method 114 

4 Direction fields 120 
5 Numerical solutions and Euler’s method 124 
Learning outcomes 130 
Solutions to exercises 131 
Acknowledgements 147 
Unit 3. Second-order differential equations 149 
Introduction 151 
1 Some preliminary remarks 155 
1.1 Requirement for two arbitrary constants 155 

1.2 Linearity and superposition 158 

2 Homogeneous differential equations 161 
2.1 ‘Two simple cases 161 

2.2 Solution in the general case 163 
2.3 General procedure and further practice 170 
2.4 Damped harmonic oscillators 172 

3 Inhomogeneous differential equations 178 
. 3.1 General method of solution 178 
3.2 The method of undetermined coefficients 182 
3.3 Exceptional cases 188 


3.4 Combined cases 190 


4 Initial conditions and boundary conditions 
4.1 Initial-value problems 
4.2 Boundary-value problems 


5 Resonance 
Learning outcomes 
Solutions to exercises 


Acknowledgements 


Index 


191 
192 
194 


198 
203 
204 
215 


217 


Contents 


Unit 1 


Getting started 


Introduction 


Introduction 


Welcome to MST224 Mathematical methods. This module is designed to 
teach you the mathematics needed to study the physical sciences and other 
subjects (such as economics) where mathematics is used to model 
phenomena in the real world. 


Everything that you will learn in this module has applications in a wide 
range of topics. Because the same mathematical techniques appear in 
different contexts, the mathematical knowledge required by a really good 
scientist need not be that extensive. This module will teach you most of 
the essential tools. If you go on to further study, you will find that while 
some additional mathematical ideas may be needed, most of the time you 
will be discovering the power of what you have learned in this module. 


Because it is easier to learn just one new thing at a time, this module will 
concentrate on teaching the mathematics. It will not attempt to teach new 
topics in physical sciences, but in some cases the best way to illustrate a 
topic is in the context of a real-world example. In cases where a little 
science is required, it will always be explained. The assessment materials 
will test only your knowledge and understanding of the mathematics. 


To illustrate the relevance of the mathematics to the real world, some 
additional material will be included in brown boxes such as the one below. 
We recommend that you read the material in these boxes, but do not get 
stuck or discouraged if you do not understand everything. All of the 
material in this style of boxed text is non-examinable, but it is hoped that 
you will enjoy reading about the applications and will want to learn more. 


Note that green boxes contain essential information that must be read and 
understood. 


The ‘unreasonable effectiveness of mathematics’ 


As an illustration of why studying the mathematics underlying 
physical sciences can be so rewarding, consider Figure 1. The 
Cartwheel galaxy collided with another galaxy — presumably the 
small blue object to the left of the image — roughly 200 million years 
before the image that we now see. 


Unleashing awesome energy, the collision sent a shock wave into the 
sparse gas around it. Expanding at nearly 100 kilometres per second, 
this ‘cosmic tsunami’ ploughed up a concentration of hydrogen gas 
and dust, creating conditions favourable for the birth of new stars 
around the galaxy’s rim. The new stars in the rim are massive and Figure 1 The Cartwheel 
extremely bright. Many have lived fast, died young and exploded, galaxy (a composite image in 


false colour) 


Unit 1 Getting started 


Figure 2. Iron atoms forming 
a ‘quantum corral’ (an image 
in false colour produced by a 
scanning tunnelling 
microscope) 


leaving behind burnt-out cores that are 5 x 10!% times denser than 
iron. Others have formed black holes that swallow up surrounding 
matter; as matter falls into a black hole, never to be seen again, it can 
produce intense bursts of X-rays. All these processes of galaxy 
collision, shock wave creation, star birth and death, and the creation 
of X-rays are explained in terms of mathematical models covering 
situations that are far from familiar everyday experience, but 
nevertheless trusted by physicists and astronomers. It is a remarkable 
testament to the power of mathematics, and our understanding of 
physical laws, that this is possible. 


The ring in the image of the Cartwheel galaxy is about 

1.4 x 107! metres in diameter. The image in Figure 2 reveals the 
equally strange world of the very tiny. It shows something called a 
quantum corral imaged by an immensely powerful microscope (a 
so-called scanning tunnelling microscope). The lumps around the 
edge of the corral are individual iron atoms standing proud of a flat 
copper surface, and the diameter of the corral is about 107° metres. 
Of most interest to scientists are the ripples inside the corral. These 
show concentrations of electrons that form patterns similar to water 
waves inside a bucket or the vibration patterns on the surface a drum. 
These analogies are close, though not exact. The interesting thing, 
from our point of view, is that water waves, drum vibrations and the 
concentrations of electrons in a quantum corral can all be understood 
using similar mathematical techniques applied to slightly different 
equations. In the case of the electrons, we are completely beyond the 
comfortable familiarity of everyday life, but the tools of mathematics 
again provide the keys to understanding. The calculations are 
demanding and use sophisticated physical principles, but most of the 
mathematics is built on what you will study in this module. 


It cannot always have been obvious that all this would be possible. 
The Nobel laureate Eugene Wigner has spoken about what he called 
‘the unreasonable effectiveness of mathematics in the Natural 
Sciences’. For example, he points out that Newton was able to verify 
his law of gravitation to within 4%, but the law turned out to be 
accurate to better than one part in a million: outrageous fortune or 
unerring instinct? He concludes that: 


The miracle of the appropriateness of the language of 
mathematics for the formulation of the laws of physics is a 
wonderful gift which we neither understand nor deserve. 


(Eugene Wigner, 1960) 


Study guide 


The module is divided into thirteen units, which you will study at a rate of 
roughly one per fortnight. Each unit will emphasise one major topic or 
technique. The units are grouped into four books, dealing with connected 
sets of topics. 


The first book is primarily about differential equations, but this first unit 
is an exception. It reviews material that you will need in order to make a 
good start on MST224. The material is broad-ranging, so this unit 
contains more pages than others, but it may take no longer to study. This 
is because most of its topics have been covered in previous modules. If you 
find some things familiar, you may not need to study everything in great 
detail. 


To help you to judge where to put your effort, most subsections begin with 
a diagnostic test. If you can answer the test questions correctly, it is 
probably safe to skim through some subsections, just checking that you 
remember the key ideas. If you choose not to study a subsection in detail, 
do check any new terms that are introduced, and make sure that you 
understand them. (New terms are set in bold type.) 


If you do not have time to study the whole unit, make sure that you are 
familiar with the basic properties of the exponential and trigonometric 
functions (Sections 2 and 3), and with techniques for differentiation and 
integration (Sections 5 and 6). These topics are required in almost every 
part of the module, and without a reasonable knowledge of them you will 
get stuck later on. 


The unit contains a number of ‘standard formulas’: for example, for the 
solution of a quadratic equation, for expanding sin(a + b) and cos(a + 5), 
and for the derivatives and integrals of standard functions. It will be 
helpful if you are able to remember such formulas, but not essential; they 
are all given in the module Handbook. You do need to be aware that these 
formulas exist, know when they are needed and how they are used, and be 
able to find them quickly in the Handbook. 


1 Some elementary functions 


Functions play a central role in mathematics. After a brief look at some 
general ideas about functions (in Subsection 1.1), this section reviews some 
very simple but nevertheless important types of function: linear functions, 
quadratic functions and powers. It also looks briefly at how functions may 
be combined by composition. 


1 Some elementary functions 


Unit 1 Getting started 


We sometimes say that the 
function ‘maps’ each input value 
to its corresponding output 
value. 


1.1 Functions, variables and parameters 


Diagnostic test 


Try Exercise 1. If your answer agrees with the solution, you may 
proceed quickly to Subsection 1.2. If not, then it is advisable to read 
this subsection more thoroughly. 


Consider the following example of defining a function. At midday on 

1 June, a reservoir contains 2 x 10° cubic metres of water. For the next 
50 days, the reservoir loses 15 cubic metres per minute. Suppose that at a 
time t minutes after midday on 1 June, the reservoir contains V cubic 
metres of water. Then we might use the equation 


V=2x10°-15t (0 <¢< 72000) (1) 


to model the volume of water in the reservoir for the 50 days 

(= 72000 minutes) after midday on 1 June. The letters V and t represent 
measurable quantities. V and ¢ are called variables. Here, V depends 
on t, so V is called the dependent variable and ¢t the independent 
variable. The comment in parentheses simply says that t lies between 0 
and 72000. 


A function is a process or rule that can be applied to each of a specified 
set of input values to produce a definite output value. One example is: 
‘siven t between 0 and 72000, calculate 2 x 10° — 15t’. If we denote this 
function by f, then we can write equation (1) as V = f(t), where f(t) is 
the result of applying the rule f to the input value t. 


The domain of a function is the set of permitted input values. The 
function f associated with equation (1) has as domain the set of real 
numbers t with 0 < t < 72000 — that is, the interval [0,72 000]. The image 
set of a function is the set of output values. The function f associated 
with equation (1) has as image set the set of values of 2 x 10° — 15¢ with 

0 <t < 72000 — that is, the interval [920 000, 2000000]. This is the range 
of volumes of water (measured in cubic metres) in the reservoir over the 
50 days following midday on 1 June. 


To define a function fully, we must give both the function rule and the 
domain. So the function corresponding to equation (1) is 


fH=2e 10 =15t <7 =< 72000), (2) 


where the expression 2 x 10° — 15t on the right-hand side gives the rule or 
formula that specifies the function, and the conditions in parentheses 
indicate the domain (the range of input values for which the function is 
valid). 


1 Some elementary functions 


To define a function, a process must produce a unique output value for 
each allowed input. So, for example, 


fga=bh/r @>0) We write +,/z to denote the 


; . ; positive and negative square 
does not define a function f because +/x does not specify a unique value — yoots of x because, by 


for a given x. convention, ,/z denotes only the 
positive square root. 


You may have noticed a subtle difference between equations (1) and (2). 
The former gives a relationship between the variables V and t, but the 
latter defines a function — the abstract rule underlying this relationship. In 
this module, however, functions will often be labelled in a way that blurs 
this distinction. For example, we may write V = V(t) rather than 
introducing a separate symbol, such as f, for the function relating V to ft. 
In this notation, V(30) is the value of V when t = 30. This is the volume 
of water in the reservoir 30 minutes after midday on 1 June. But V(¢) is 
generally taken to represent the function itself. If there is any doubt, the 
context will make it clear whether we are referring to a function or just one 
of its values. 


We can also consider a generalisation of the reservoir model. Suppose that 
the reservoir initially contains Vo cubic metres of water and that the rate 
of loss is LZ cubic metres per minute. We will assume that this situation 
persists for 72000 minutes after a starting time t = 0. Then we have 


V =V(t)=Vo—Lt (3) 


for 0 <t < 72000. We now have an equation involving several symbols, 
with differing roles. Assuming that we want to use equation (3) to describe 
how V changes with time t, we continue to call t the independent variable 
and V the dependent variable. The function V(t) tells us how to relate a 
time t to V. The quantities Vo and L do not depend on t. They may, 
however, take different values in different uses of equation (3) — in an 
application to a different reservoir, for example. Vo and L are called 
parameters. Whatever the values of the parameters Vo and L, 

equation (3) gives a similar form of relationship between V and ¢t: for 
example, the independent variable t appears in a similar way in any of the 
expressions 12000 — 5t, 300 — 6.6¢ and 14 — 2t. 


Often, having defined a function that maps x to y (say), it is useful to 

define another function that reverses this operation, so that if y = f(a), we 

might wish to define another function g such that « = g(y). In this case g 

is called the inverse function of f. As an example, if f(x) = x? with 

domain x > 0, then solving y = f(x) for x gives the inverse function 

x =g(y) = /y. By definition, the domain of the inverse function g(y) is We ignore the negative solution 
the image set of f(x), and the image set g(y) is the domain of f(z). —,/y because we know x > 0. 


Unit 1 Getting started 


3 1 


The notation m’s~~ is 
shorthand for ‘metres cubed per 
second’. 


Exercise 1 
Consider the function 
f(x) =207+6 (#>0). 
(a) State the domain and image set of f. 


(b) Find the inverse function, and state its domain and image set. 


A note on units and dimensions 


If you have taken a science course previously, you will be aware that 
most physical quantities have units or dimensions associated with 
them. This module will often use SI units: the SI units of distance 
and time are metres and seconds (denoted by m and s, respectively), 
and the SI units of volume are cubic metres (denoted by m?). In 
equations where symbols represent physical quantities, there are two 
acceptable ways of combining units and symbols; both are correct, 
but they must not be mixed together. 


Method A_ The units are incorporated into the symbols 


For example, the volume of the reservoir is written as V = 10° m°, 
where m denotes metres. In this convention, the symbol V has units 
of m°. 

Then, if we use an equation like (3), with initial volume Vo = 10° m?, 
rate of water loss L = 10m*s~! and time t = 1000s, we write 


V = 10° m* — (10m*s_! x 1000s) 
= 10° m? — 10000 m® 
= 990000 m°. 


Method B- The symbols are pure numbers 


For example, the volume of the reservoir is written as V cubic metres, 
where V = 10° is a number with no units. In this convention, the 
units are stated in the text, but the equations involve pure numbers. 


In this approach, if we use an equation like (3) with initial volume 
Vo = 10° (in cubic metres), rate of water loss L = 10 (in cubic metres 
per second) and time ¢ = 1000 (in seconds), then we write 


V = 10° — (10 x 1000) = 990000. 


It is then helpful to round off the calculation with a sentence that 
includes appropriate units, for example stating that the volume in the 
reservoir at time 1000 seconds is 990000 cubic metres. 


This module generally adopts the latter approach, so units will not 
appear in equations. If units are required in the final answer, the 
wording of the question will guide you as to what they should be. 


1.2 Linear functions 


Diagnostic test 


Try Exercises 2(a) and 3. If your answers agree with the solutions, 
you may proceed quickly to Subsection 1.3. 


A linear function relating y to x is one of the general form 
y= y(xc) = mae+e, 


where m and ¢ are constants. The graph of such a function is a straight 
line (hence the term ‘linear’), as in Figure 3. The constant c represents the 
value of y at the point where the line crosses the y-axis. The gradient (or 
slope) of the graph is the same everywhere, and is equal to m. That is, for 
any two points (21, y1) and (a2, y2) on the graph, we have 


— Y2- yl 
v2 — X41 


m 


gradient m = a 
LQ — £1 


Ry 


Figure 3. The graph of the linear function y(x) = ma +c 


One situation where linear functions arise is when an object moves in a 
straight line with constant speed. Let us look at an example. 


At 11.00 pm, a smuggler’s boat passes a detector buoy that is 2 kilometres 
from a port. The boat then moves at a steady 5 metres per second on a 
straight course directly away from the port (along the line AZ in 

Figure 4). A coastguard vessel leaves the port in pursuit at midnight, 
travelling at 7 metres per second. When will it catch the smuggler’s boat? 


2000 m 


Figure 4 A boat chase modelled with linear functions 


1 Some elementary functions 


In this module, if no domain is 
specified for a function, and 
there are no obvious points 
where it is not defined, you can 
assume that the domain is R, 
i.e. the set of all real numbers. 


Unit 1 Getting started 


These equations are linear, since 
they can be rewritten in the 
form y= ma+c. 


10 


Suppose that we choose to measure time in seconds, starting from 
midnight, and distance in metres, measured from the port A. Let 

X metres be the distance of the coastguard vessel from A at time t seconds 
after midnight, and let Y metres be the distance of the smuggler’s boat 
from A at the same time. We can readily obtain an expression for X in 
terms of t, since X = 0 when t = 0, and the coastguard vessel travels at a 
constant speed of 7 metres per second: we have 


X = 7. 


We also want an expression giving Y in terms of t. The boat is moving at 
a constant speed of 5 metres per second, so Y will be related to t by a 
linear equation of the form 


Y =5t+c, 


where c is a constant. We also know that (at point B) Y = 2000 at 
11.00 pm, which is 1 hour, or 3600 seconds, before midnight and so 
corresponds to t = —3600. 


Exercise 2 


(a) Find the value of c such that Y = 5t + c satisfies the condition 
Y = 2000 at t = —3600. 


(b) (i) When will the coastguard vessel catch the smuggler’s boat? 


(ii) The limit of territorial waters is 100 kilometres from A. Will the 
vessel catch the boat within territorial waters? 


Simultaneous linear equations 


Surveying equipment often uses laser beams, which travel in straight lines 
and can therefore be described by linear equations. We might wish to 
locate a point in space by finding the position where two laser beams cross. 
This is one of a wide variety of situations where we need to find the 
intersection of two straight-line graphs, which is equivalent to the algebraic 
problem of solving two simultaneous linear equations. Consider, for 
example, the following linear equations (graphed in Figure 5): 


Ae + 3y = —1, (4) 
3a + y = 3. (5) 
There are many ways of solving these equations: one powerful method is 


Gaussian elimination. Let us see how this works for equations (4) 


and (5). 


Figure 5 Graphical representation of the equations 4% + 3y = —1 and 
3x2 + y = 3, and their solution 


The aim of the method is to subtract a multiple of the first equation from 
the second in order to eliminate the x terms. First, we multiply 

equation (4) by 3, to obtain an equation with the same coefficient of x as 
in equation (5): 


3x dr +3 x 8y = 3x (-1), 
which simplifies to 
3a + fy = —#. (6) 


Now we subtract equation (6) from equation (5). This eliminates x, and 
gives 


Sea) = 3t4a— a 

that is, —3y = 2. so y= —3. 

To find x, we substitute this value of y into equation (4), to obtain 
Ax + 3(—3) = -1, 

which gives 44 = —1+9 = 8, and hence x = 2. 

So the solution of equations (4) and (5) is = 2, y= —3. 


Exercise 3 

Use Gaussian elimination to solve the following equations for u and v: 
2u — 5v = 19, 
3u + 4v = —29. 


1 Some elementary functions 


You may like to check these 


values by substituting them into 


equations (4) and (5). 


11 


Unit 1 Getting started 


0 1 2 
Figure 7 A ball is thrown 
vertically upwards 


This equation may be taken on 
trust. 


12 


1.3 Quadratic functions 


Diagnostic test 


Try Exercises 4(a) and 5. If your answers agree with the solutions, 
you may proceed quickly to Subsection 1.4. 


A quadratic function relating y to x is a function of the general form 

y = y(x) = ax? + br +c, (7) 
where a, b and ¢ are constants, and a 4 0. The graph of such a quadratic 
function is a parabola, and may open ‘up’ or ‘down’, depending on the 
sign of a, as illustrated in Figure 6. 


A A 
ésu. e a2 


Ry 


Sv 


y = 227432 —4 y = —27 +42 +15 


Figure 6 Graphical representation of two quadratic functions 


If a > 0, then the graph opens ‘up’ and positive values of y may become 
arbitrarily large, but there is a smallest (minimum) value that y can take. 
If a < 0, then the graph opens ‘down’, and negative values of y may 
become arbitrarily large in magnitude, but there is a largest (maximum) 
value that y can take. 


For example, suppose that a ball is thrown directly upwards at time 
t = 0 seconds, with initial velocity 10ms~!, from a height of 2.0 metres 
(see Figure 7). 


The ball moves under the influence of gravity. Its position, y metres above 
the ground after t seconds, is given by 


y = —4.9t? + 10¢ + 2. 


Suppose that we want to find when the ball will hit the ground — that is, 
the value of t when y = 0. Then we need to solve the quadratic equation 


—4,9¢? + 106 +2 =0. (8) 


You will have met the formula for the solution of a general quadratic 
equation before. It is given below. 


1 Some elementary functions 


Procedure 1 Solution of a quadratic equation 
The quadratic equation 
ax? + ba +c=0, 


where a, b and c are constants, and a # 0, can be solved for x using 
the formula 


= —b+ Wi b2 — 4dac 


eo = 9) 
on (9) 
The solutions of a quadratic equation are often referred to as its 
roots. The term ‘root’ is also used for a 
; . ee solution of other sorts of 
Notice that the sum of the roots is —b/a, which is a useful check. equation, as you will see in 
Section 4. 


Using this formula to solve equation (8) for t gives 


<p 3 
—_ —ae= = 2.2 or —0.18 


(to two significant figures). Here the solution t = —0.18 refers to a time 
before the ball is thrown, so it can be discarded. The ball hits the ground 
about 2.2 seconds after it is thrown. 


In this example, the quadratic equation has two solutions, but this is not 
always true. Look at the graphs in Figure 6, and imagine moving them up 
and down (which corresponds to varying the value of c). It is clear that the 
a-axis may meet a quadratic graph in two places, or it may touch at just 
one place (where the graph has a maximum or minimum value), or it may 
not meet the graph at all. Examples of these three cases are shown in 
Figure 8. 


yr yr yr 


Sy 
Sy 
Sv 


b? — dac > 0 b? — dac = 0 b2 — dac < 0 


Figure 8 Graphs of the quadratic equation y = az? + br + c for a > 0 
and different values of b? — 4ac 


The three cases can be understood by inspecting equation (9), which 
involves the quantity b? — 4ac. 


13 


Unit 1 Getting started 


In Section 4, you will see that 
complex numbers enable us to 
write down square roots of 
negative numbers, and hence to 
produce (complex) solutions of a 
quadratic equation even when 

b? — 4ac < 0. 


w is the Greek letter omega. The 
Greek alphabet is given in the 
Handbook. 


You may like to check this by 
multiplying out (a — 2)(a# + 3). 


14 


e If? — 4ac > 0, we obtain a positive value for Vb? — 4ac, leading to 
two distinct solutions. 


e = If b? — 4ac = 0, it follows that //b? — 4ac = 0, leading to just one 
solution. (Although, as you will see below, this one solution is 
sometimes regarded as two solutions with equal values.) 


e = If b? — 4ac < 0, there are no real values for V/b2 — 4ac, leading to no 
real solutions. 


The quantity b? — 4ac is often referred to as the discriminant of the 
quadratic equation because it discriminates between these three cases. 


Exercise 4 
Solve the following equations for x. 
(a) 227+72-4=0 (b) s?+2-6=0 


Sometimes, you may find that you need to solve a quadratic equation 
where the coefficients are letters rather than numbers. 


Exercise 5 
If K and w are constants with K > w, show that the solutions (for x) of 
a? 4+2Kr+u?%=0 


arexr = —K+ VK? —-w?%. 


The solutions of a quadratic equation correspond to a factorisation of the 
corresponding quadratic function. For example, «7 + « — 6 = 0 has 
solutions x = 2 and x = —3, and this corresponds to the factorisation 


a? +” —6 = (2 —2)(2 +3). 


With experience, you may find that such factorisations provide a 
convenient way of solving some quadratic equations, but the formula in 
equation (9) provides a reliable method that can be used in all cases. 


One point of caution: if you want to factorise a quadratic function, you can 
do this by first solving the equation (e.g. by using equation (9)), but you 
need to be careful to match the coefficient of x? in the factorisation with 
that in the nein quadratic function. For example, 27? + 7z — 4 = 0 has 


solutions = 5 and x = —4, but to factorise 2x? + Tx — 4 we write 


2a* + 7x —4 = 2 (x — 3) (a +4) = (22 — 1)(¢ + 4), 


where the 2 is needed to ensure that the coefficients of x? are the same on 
each side. 


It is helpful to recognise some particular factorisations. Two useful ones are 
a? +2Ae+A*=(2+A)? and 2? -—2Ar+ A? = (2 — A)’. 


So, for example, 2? — 62 + 9 = (x — 3)”. Such quadratics are called 
perfect squares. Perfect squares correspond to quadratic equations in 


which the discriminant b? — 4ac is equal to zero. (You may like to check 
this for yourself.) Thus equations in which the discriminant is zero can be 
written in the form (a + A)(z + A) =0 or (a — A)(x — A) = 0, and these 
factorisations lead us sometimes to consider such equations as having two 
equal roots (c = —A and x = —A, or x = A and x = A) rather just one 
root. 


Another useful factorisation is 
a? — A* = (9+ A)(x — A). 


So, for example, x? — 16 = (2 + 4)(a — 4). Such a quadratic is called a 
difference of two squares. 


You need to be particularly careful when solving a quadratic equation that 
involves the same letters as appear in the standard formula (9), but in a 
different way. 


Example 1 
Solve for x the equation 
aba” — (a +b)r +1=0, 
where a and 6 are non-zero constants. 
Solution 
You need to keep a cool head here, because the letters in equation (9) are 
used in a different way in the given equation. In equation (9), we need 
ab fora, —(a+b) for b, 1 fore. 


So we obtain the solutions 
a+b (a + b)? — 4ab 
2ab ; 
This expression gives the solutions, but it turns out to be possible to 


express them in a much simpler form. We have (a+ b)? = a? + 2ab + b?, so 
the discriminant can be written as 


(a+ b)? — 4ab = (a? + 2ab + 6’) — 4ab = a? — 2b +b? = (a — 8)”. 
So the solutions can be written in the alternative form 
atb+ (a — b)? a+b+(a—b) 
¢ = —_—_— 


2ab 2ab 
Now (a+ 6) + (a — b) = 2a and (a+ b) — (a — b) = 26, so the two solutions 
dre = l/a and = 1/d. 


Exercise 6 
Factorise the following expressions, where a > 0. 
(a) 22-a (b) 2x7 — 8a (c) xt — 627 +9 


1 Some elementary functions 


This solution also follows from 
the factorisation 
(ax — 1)(ba — 1) = 0. 


15 


Unit 1 Getting started 


In a”, a is called the base, and 
n may be referred to as the 
power, the index or the 
exponent. 


Recall that the nth root of a 
number a is a number b such 
that b” = a, and we write 


b= Ya. 


The negative square root of 5, 
for example, would be written as 


—5!/2 or —+/5. 


16 


1.4 Powers 


Diagnostic test 


Try Exercise 7. If your answer agrees with the solution, you may 
proceed quickly to Subsection 1.5. 


You will know that 10° = 10 x 10 x 10 x 10 x 10. In general, a” means the 
product of n copies of a (for any real number a and any positive integer 7). 
In particular, a! = a. 
For positive integers m and n, we have the property 

aaa: (10) 
since each side is the product of n +m copies of a. For example, 

10° 10? = 10 
Consequently, if we multiply m copies of a”, we obtain 

m times 
——__ 

osha eae eee gag. 

oe 
m times 


that is, 
For example, (107)? = 10°. 


The definition of a” can be extended to cases where n is not a positive 
integer by assuming that equations (10) and (11) hold more generally. For 
a #0, this assumption leads to the definition of a? as 1, and a~” as 1/a”; 
and, for a > 0, to the definition of a!/” as the nth root of a, and a™/” as 
the nth root of a”. So, for example: 


10-* = 1/104 = 0.0001; 


27/3 = ¥27 =3 (since 3° = 27); 
1 1 1 1 


BP We ei 8 

If a is positive, it is conventional to take the fractional power of a to be 
positive. So, for example, 9!/2 = /9 means 3 rather than —3. If a is 
negative, fractional powers of a do not necessarily exist — at least not as 
real numbers. (The square roots of negative numbers are discussed in 
Section 4.) 


4-3/2 


The following properties of powers hold for all real numbers a > 0 and all 
real exponents x and y. These properties are not proved here, but we will 
make use of them as necessary. 


a” > 0, (12) 
i? =a, (13) 


1 Some elementary functions 


ay =a <a"; (14) 
ceo %, (15) 
tera ag. (16) 


We also note that 0” = 0 for all x £0. 


Finally, given positive numbers a and b, we have the following rules for 
products and quotients: These rules do not apply if a or 


b is negative. For example, 
(ab"=a xb and (a/b) =a" /b”. z . 


V(-1) x (-1) 4 V(-DV(-)). 
For example, 15’ = 3" x 5" and (5/3)* = 54/3+. 


Exercise 7 


Use the properties of indices to simplify each of the following, where a > 0 
and x is real. 


(a) aa® —(b) a3fa®—(c) (a®)8_— (a) (2-4) x 4 
(ec) 8/3 (f) 163/4 —(g) (4)? (h) (1624) 2 


1.5 Combining functions 


Diagnostic test 


Try Exercise 8. If your answer agrees with the solution, you may 
proceed quickly to Section 2. 


Consider the following, which leads to an example of the composition of 
two functions. A hiker travels a distance of x kilometres after walking for a 
time t hours along a path. As he travels, the path rises up a mountain, so 
that after walking x kilometres along the path, his height (in kilometres) 
is h. 


We might express x as a function of t using a function f, so that x = f(t). 
As a practical example, the distance might be 


a= f(t)=2t—4t? (for0<t< 2). 


Similarly, the height can be expressed as a function of x by another 
function g, so that h = g(x). For example, the height might be 


h=g(x)=4x—}a* (for0<a2 <3). 


Now, we might want to determine the hiker’s height h as a function of 
time t. With h = 5x - ga and «© = 2t — xt, we have 


2 
h= 9 (2-90) — 3 (2— a0) = (t— gt) — 3 AP + Get!) 
=t 


542, 143 144 


17 


Unit 1 Getting started 


18 


so we can write h = H(t), where H(t) is the function 


H(t) =t— 30+ 30 — tt. 


It is a very common procedure to express a quantity that is a function of 
one variable (height as a function of position, in our example) in terms of 
another variable (in this case, time). There is a shorthand way of 
describing what is being done in mathematical language. The relation 
between the function H and the functions f and g is written 


A(t) = g(f(t)), 


meaning that to obtain the function H we apply the function f to t, and 
then the function g to the result. We say that H is the composition of 
the function g with the function f. 


In the hiker example above, the function g(x) gives height as a function of 
position, while the function H(t) gives height as a function of time. These 
are different functions, and texts in pure mathematics are usually strict 
about this distinction, and use different symbols for these functions, just as 
we have done above. When mathematics is used in the physical sciences, 
however, another approach is frequently used: we write h(a) to denote 
height as a function of distance, and h(t) to denote height as a function of 
time. The symbol inside the brackets (x or t) indicates which of the 
functions h(x) = $2 — £2” or h(t) =t — 3t? + $43 — <1" is intended. If 
this approach were not adopted, we would have to introduce a different 
symbol for a quantity every time we make a change of variable, which 
would make some physics texts impossible to read. The approach works 
well so long as you do not use h(x) and h(t) in the same equation. 


Later on, when doing calculus, you will see that it is useful to be able to 
recognise a complicated function as the composition of simpler ones. 
Remember that the composite function g(f(z)) tells us to ‘apply f first, 
then g’, while f(g(zx)) tells us to ‘apply g first, then f’. The ‘inner’ 
function is applied before the ‘outer’ function, and the order in which the 
functions are applied matters! 


Example 2 


Express the function 
1 
h(x) = ——, (17) 
(V1 + 227 ) 
as a composite of a quadratic function and a power function. 
Solution 


Note first that 1+ 2a? is quadratic and that by writing y = 1+ 2x7, the 
1 


(Vy)? 


right-hand side of equation (17) becomes = y—3/2 (a power). 


So we can obtain h(x) in two steps. 


2 The exponential and logarithm functions 


Step 1 Calculate y = 1 + 227. 


1 
Step 2 Apply ——>y = y°/2 to the result of Step 1. Here, the domain of g is x > 0, 
(Vy) but since f(x) = 14 22? is 
: = always greater than 0, there is 
So if f(z) = 1+ 2x? and g(y) = y~*/, then h(x) = g(f(z)). ae Bee 
Exercise 8 


(a) If f(x) = 2? and g(x) = 1/(x — 1), with x > 1, find the following. 
(i) F(g(@)) Gi) 9 F(@) 


(b) Express h(a) = sin(1 + 27) as a composition of three basic functions. 


2 The exponential and logarithm 
functions 


Diagnostic test 


Try Exercise 11. If your answer agrees with the solution, you may This diagnostic test covers both 
proceed quickly to Section 3. the subsections in this section. 


2.1 The exponential function 


The exponential function plays a central role in mathematics. Here its 
definition is given, followed by two of its most significant properties, before 
we examine some of their consequences. 


The exponential function exp(x) can be defined by writing an infinite series You may have met other 
definitions of exp(a). These can 


2 3 n 
= ca aa be sh to be equivalent to 
=1 — pala rae: dae 18 e shown q 
exp) et 2! = 3! ais n! rs eo) equation (18). 
where n! is the factorial of n, that is, 
nl =n x (1—1) * (@— 2) K=--K BRK 21. For example, 2! = 2, 3! = 6 and 
4! = 24. 


This definition involves adding an infinite number of terms together. In 
general, dealing with infinite series can be a source of difficulties; for 
example, there will be problems if the terms keep growing as you go to 
larger and larger values of n. There is no problem with the series for the 
exponential function: as you add more terms in equation (18), their sum 
eventually approaches a definite number exp(x), which depends on x. The 
infinite series in equation (18) is called the Taylor series for exp(z). 

A graph of the exponential function is shown in Figure 9. 


19 


Unit 1 Getting started 


Differentiation is discussed in 


Section 5. 


ve) 
Note that — = 
n! 


20 


1 
(n—1)! 


T 
—1 —0.5 0 0.5 1 1.5 2 


Sv 


Figure 9 Graph of the exponential function 


Exercise 9 


With the help of a calculator, add together the first six terms in 
equation (18) for « = 1/2. Repeat your calculation for « = 1 using seven 
terms, and for x = 2 using eight terms. Hence obtain estimates for 
exp(1/2), exp(1) and exp(2). 


Now for the two important properties of the exponential function. The 
first is that 
dexp(x) 
———— =exp(2). 19 
Pte) = expla) (19) 
That is, when you differentiate the exponential function, the result is just 
the function you started from. 


The second property is that 
exp(z + y) = exp(x) exp(y). (20) 


These two properties (equations (19) and (20)) touch on so many aspects 
of mathematics that their power will become clear only as you progress 
through the module. 


Equation (19) makes the exponential function a powerful tool for dealing 
with differential equations, as you will see later in this book. In order to 
see that this expression is correct, apply the rule for differentiating x” to 
each term in the definition, equation (18). This gives 


d =f teg 22 3a? ngr—t 
goer ot a a 
a fn gril 
ee are a 
ata tat tagcat 


2 The exponential and logarithm functions 


Equation (20) is more difficult to derive directly from the definition, but 
the following exercise checks that it is correct in particular cases. 


Exercise 10 


Use the numerical values of exp(1/2), exp(1) and exp(2) quoted in the 
solution to Exercise 9 to provide a numerical check that 

exp(1/2) x exp(1/2) = exp(1) and exp(1) x exp(1) = exp(2), as predicted 
by equation (20). 


An alternative notation 


The property exp(x + y) = exp(x) exp(y) is analogous to the property 
noted earlier for powers: 


at =a a! Gora > 0), 


This suggests that it is sensible to regard exp() as the power of some 
positive number. We write 


exp(x) = e*, (21) 
where e is known as Euler’s number. The value of e is easily found by 
putting « = 1 in equation (21). This gives 

e = exp(1) = 2.718 281 828..., 
using a numerical result from Exercise 9. We then have exp(2) = e?, 
exp(3) = e?, and so on. 
Writing the exponential function in the form exp(x) = e”, equation (20) 
takes the form 

et t¥ = eve 
for any numbers x and y, and this is a special case of the equation 


a™*¥ = a*™a¥ for powers. The exponential function shares other properties 
with powers of positive numbers. In particular, 


e”* >0, (22) 
_ 1 

€ ae en? (23) 
a 

—=e" 4, (24) 
ev 

(er) =e (25) 


The number e 


The number e is named after Leonhard Euler (Figure 10), who 
explored many properties of the exponential function, but the 
discovery of the number e is credited to Jacob Bernoulli (Figure 11), 
who used it to solve a problem involving the repayment of interest. 
(Euler and Bernoulli are pronounced as ‘oiler’ and ‘ber-noo-lee’.) 


This value is correct to only ten 
significant figures; in fact, e is an 
irrational number, with a 
never-ending and 
never-repeating decimal 
representation. 


Figure 10 Leonhard Euler 
(1707-1783) 


Figure 11 Jacob Bernoulli 
(1654-1705) 


21 


Unit 1 Getting started 


When z is a single symbol, we 
may also write Ina rather than 


In(a). 


22 


2.2 The natural logarithm 


It is useful to have a function that does the opposite of the exponential 
function. This function is called the natural logarithm function, and is 
given the symbol In(z). The natural logarithm is the inverse function of 
the exponential function, so if y = exp(x), then x = In(y). 


Some texts give other notations for In(x), such as log.(x) and log(x). We 
often refer to In(a) as the logarithm function (although other less useful 
functions are also logarithms!). ‘Take the logarithm of x’, for example, 

generally means ‘apply the natural logarithm function to x, to give In(x)’. 


Because the exponential of a real number is always positive (see Figure 9), 
the natural logarithm function In(z) is defined only for x > 0. Its domain 
is the set of real positive numbers, x > 0. A graph of the natural logarithm 
function is shown in Figure 12. 
y A 
24 


Sv 


Figure 12 Graph of the natural logarithm function 
The logarithm function obeys the important property 
n(xY) =nxX+InY (26) 


for all X > 0 and Y > 0. To see why this is true, let X = exp(a) and 
Y =exp(y). Then, substituting in the left-hand side of equation (26) and 
using equation (20) gives 


In(XY) = In(exp(x) exp(y)) = In(exp(z + y)). 


Because the logarithm and exponential are inverse functions of one 
another, the right-hand side becomes simply « + y. By replacing 

x = |n(X) and y = In(Y), we obtain the fundamental property of the 
logarithm function given in equation (26). 


2 The exponential and logarithm functions 


Log tables and slide rules 


Before electronic calculators became inexpensive, equation (26) was 
used to multiply or divide numbers. One converted numbers to their 
logarithms using a book of tables (log tables), added them (which is 
easier than multiplying), and used another table to find the answer. 


An application of this idea is the slide rule, a mechanical calculator 
marked with logarithmic scales, where the addition step is performed 
by moving one scale past the other. Pilot training still requires 
knowledge of this device, partly because it still works when every 
electrical circuit has failed! 


Several properties of the exponential function were listed in 
equations (22)—(25). By taking logarithms on both sides of these equations, 
we obtain the following results for the natural logarithm function: 


In(1/u) = —Inu, (27) 
In(u/v) =nu-— Inv, (28) 
In(u’) = vinu. (29) 


Exercise 11 


Simplify each of the following (where a, b and x are positive, and x and y 
are real). 


(a) n7+n4-—Inl4 (b) na+2Inb—In(a’b) 
(c) In(e” x e¥) (dy gen (e) e?inz 


(f) exp(2Inz + In(x + 1)) 


We sometimes want to calculate the value of 
y=a" 


where a > 0 and z is not an integer or a simple fraction. For example, we 
may need to evaluate 5”. The exponential and natural logarithm functions 
allow us to do this. We write 
a = exp(In(a)) = e™, 
and then 
at = (euay* = ge. 


which can be evaluated using the In and exp functions. 


(30) 


Exercise 12 


Use equation (30) to determine the values of A = 5™ and B = 10~*!° to 
four significant figures. 


Common log tables (e.g. 

Figure 13) were based on 
another logarithm function, 
logi9, defined so that if 10¥ = a, 
then y = log,, z. This function 
is rarely used in advanced 
mathematics. 


A 


DESCRIPTION 
OF THE ADMIRABLE 
TABLE OE LOGA.- 
RITHMES: 
wIiTH 
A DECLARATION OF 


True Most Prentirrvt,Basy, 
and {peedy vfe thereof in both kindes 
of Trigonometrie, as alfo in all 
Matbematicall caleulations. 


Figure 13 Part of the title 

page of a 1616 translation of 
John Napier’s A Description 
of the Admirable Table of 


Logarithmes. 


This calculation of a” is built 
into most modern calculators. 


23 


Unit 1 Getting started 


Differential equations are 
equations involving derivatives. 
They are the subject of the next 
two units. 


fe] cS 


b 


Figure 14 Trigonometric 
ratios: sin? = a/h, 
cos 8 = b/h, tan@ = a/b 


Recall that 180° = 7 radians. 


yr 


Sv 


Figure 15 The point A has 
coordinates (cos 6, sin @) 


24 


Distinguish carefully between a function of the form x%, and a function of 
the form a”, where a is a constant. A function of the form x“ is called a 
power function. Examples include x”, x!/? and 2/2. By contrast, a 
function of the form a”, where a > 0, can be written as a® = e**, where 

k =lna. We say that this describes an exponential dependence, 
although e** is only the exponential function, e”, if k = 1. 


3 Trigonometric functions 


This section adds another class of functions to the ‘library’ developed in 
Sections 1 and 2. These are the trigonometric functions. They originate in 
the geometry of right-angled triangles, but in this module we are equally 
often concerned with their use in describing repetitive or oscillatory 
behaviour. In particular, they arise as solutions of certain differential 
equations. 


3.1 Introducing the trigonometric functions 


Diagnostic test 


Try Exercise 13. If your answer agrees with the solution, you may 
proceed quickly to Subsection 3.2. 


You will have met sin @ = a/h, cos@ = b/h and tan@ = a/b as ratios in a 
right-angled triangle (see Figure 14). However, these definitions of the sine, 
cosine and tangent functions work only for 0 < 6 < 7/2, where @ is in 
radians. 


Note that many of the formulas in this module are valid only in 
radians, and angles will almost always be expressed in radians. 


To define the sine and cosine functions for a general value of 8, we can 
use Figure 15, which shows a circle of radius 1. Imagine that the line OA 
started along the x-axis, and was then rotated anticlockwise through an 
angle 6. Then the point A has coordinates (cos 0,sin @). Here 6 may have 
any value, and this defines the cosine and sine functions for all values of 6 
(positive, zero or negative). A negative value of @ corresponds to a rotation 
clockwise. 


If we rotate through 27 radians (360°), then we go round a full circle. So 
rotations of 6 and 6+ 27 leave A in exactly the same place. This leads to 
the repetitive nature of the graphs of sin and cos: we have 


sin(@+ 27) =sin@ and cos(0+27)=cos@, for any 0 


(see Figure 16). Functions like this, which repeat their values every 27, are 
said to be periodic, with period 27. 


Figure 16 Graphs of the sine and cosine functions 


You should also note that the graph of the cosine function is the same as 
that of the sine function, shifted to the left by 7/2. We can express this 
mathematically as 

cos 6 = sin(@ + 7/2). 


Other trigonometric functions can be defined in terms of sin and cos. You 
will have met the tangent function tan @ = sin @/cos@. This is defined for 


all real 9 except where cos@ = 0 (i.e. at @ = +7/2, +37/2, and so on). You 
may also have met 
1 1 1 cos 6 
6 = — $= — d ie= = ; 
ia og’ Ow" snd tand sind 


We need to restrict the domains of cosec and cot to exclude points where 
sin @ = 0, and the domains of sec and tan to exclude points where cos 6 = 0. 


Exercise 13 
(a) Use Figure 15 to find the values of sin @ and cos @ for @ = 0 and 0 = §. 
(b) 


Find the values of tan @, sec @, cosec @ and cot @ at @=0 and @= $ in 
all cases where these values exist. 


(c) Two right-angled triangles are shown in Figure 17. Use these to 
calculate the values of sin @, cos 9, tan@ and cot @ for @ equal to each of 
a 7 oud =. 


For what values of 6 is sin@ = 0? (Refer to Figure 16.) 


3 Trigonometric functions 


These functions are referred to 
as secant, cosecant and 
cotangent. 


a 3 
1 1 


Figure 17 Two right-angled 
triangles 


25 


Unit 1 Getting started 


The graphs of sec, cosec and cot 
are given in the Handbook (as 
well as those of sin, cos and tan). 


26 


The function tan has the graph shown in Figure 18. Notice that tan @ 
actually repeats its values every 7. (This is because sin(@ + 7) = —sin@ 
and cos(@ + 7) = —cos8, so that tan(@ + 7) = tan 0.) 


y = tand 


Figure 18 Graph of y = tand 


3.2 Inverse trigonometric functions 


Diagnostic test 


Try Exercise 14. If your answer agrees with the solution, you may 
proceed quickly to Subsection 3.3. 


Suppose that you need to solve for x the equation 
_1 
COS = 5. 


What solutions are there? You have seen (in Exercise 13(c)) that 
cos 3 = 5; so one solution is certainly x = 3. There are others, however. 
For instance, since cos repeats its values every 27, another solution is 
x = % +27. We can find an infinite number of solutions by adding or 
Tv 


subtracting multiples of 27 to/from 3. There are even more solutions. If 


you look at the graph of cos in Figure 16, you can see that a horizontal line 
at y = “ would cut it twice between 0 and 27: we also have cos _ = 5. 
And more solutions can be found by adding or subtracting multiples of 27 


5 
to/from =. 


In general, an equation of the form 
cosz = y (31) 


is solved for x by finding a value of the inverse trigonometric function 
arccos: 


w = arccos y. 


However, we need to be careful here. Solutions of equation (31) are not 
unique, as we saw for y = 5: If we reverse the roles of the axes for the 
cosine curve in Figure 16, we obtain the curve shown in Figure 19. 
However, this is not the graph of a function: a vertical line may meet the 
curve in many places, reflecting the fact that for a given y, equation (31) 
may have multiple solutions x. To ensure that arccosy has a unique value, 
we need to restrict the range in which values of arccos can lie. The 
restricted values of arccos are given in Table 1, together with those of two 
other inverse trigonometric functions, arcsin and arctan. In Figure 19, 
when the values taken by arccos are restricted, we obtain just the part of 
the curve shown in blue, which zs a valid graph for a function. 


LA 
3m 


Figure 19 The graph of x = arccosy is just the blue part of the curve 


Table 1 Domains and values of inverse trigonometric functions 


Function Inverse Domain of Restricted values of 
function inverse function inverse function 
y=sinx «x =arcsiny -l<y<l = ess 
y=cosx £=arccosy -l<y<l O<a<7 
y=tanz x =arctany R ee 


Calculators and computer software can be expected to give values of the 
inverse trigonometric functions drawn from suitably restricted values, 
usually those in Table 1. However, these values are not always appropriate 
in particular real-world problems. It is therefore important to be alert to 
the fact that an equation such as cosxz = y actually has infinitely many 
solutions: if |y| < 1, there are two solutions in the range 0 to 27, together 
with infinitely many others obtained by shifting these two by multiples 

of 27. 


Exercise 14 
(a) Find all the solutions of sin @ = 0.8 in the range 0 to 67. 
(b) Find all the solutions of tan@ = 1. 


3 Trigonometric functions 


Some texts use sin~', cos~! and 
? 


tan! rather than arcsin, arccos 
and arctan. 


Graphs of arcsin and arctan, as 
well as arccos, are given in the 
Handbook. 


For |y| > 1, cosa = y has no 
solutions. 


You will need a calculator in 
order to get started. 


27 


Unit 1 Getting started 


28 


3.3 Some useful trigonometric identities 


Diagnostic test 


Try Exercises 15(c), 16(a), 16(b) and 16(g). If your answers agree 
with the solutions, you may proceed quickly to Section 4. 


Figure 20 shows the relation between a clockwise rotation through 6 
(regarded as a rotation through —@) and an anticlockwise rotation 
through @. Notice that such rotations lead to equal x-coordinates but to 
y-coordinates of opposite signs. So we have 


cos(—9) =cos@ and sin(—6) = —sin@. 


These relations hold for all values of 0, and are examples of 
trigonometric identities. Such identities can be useful in a variety of 
contexts, such as simplifying expressions involving trigonometric functions 
or evaluating integrals. 


YA 


sin(@) 


Qyv 


sin(—@) 
(cos(—@), sin(—@)) 


Figure 20 The effects of positive (anticlockwise) and negative (clockwise) 
rotations 


There are three particularly useful trigonometric identities that you should 
remember. 


Cos: @-esinag — ly (32) 
sin(@ + ¢) = sin@cos ¢ + cos Osin ¢, (33) 
cos(0 + ¢) = cos 6 cos ¢ $ sin sin ¢. (34) 


Other useful trigonometric identities are given below. You should be aware 
of these identities, but there is no need to remember them, as they can all 
be very easily derived from equations (32)—(34) in one or two lines (see 
Exercise 15). 


3 Trigonometric functions 


1+ tan? 6 = sec” 6, (35) All the trigonometric identities 
discussed in this subsection are 


2 - 2p. 
cot” @ + 1 = cosec" 6; (36) included in the module 
Handbook for easy reference. 
tan(@ + ¢) = (tan@ + tan ¢)/(1 F tan tan ¢)); (37) 
sin Osin = $[cos(@ — ¢) — cos(0 + ¢)], (38) 
cos 0.cos ¢ = 5[cos(0 — $) + cos(6 + 4)}, (39) 
sin Ocos $ = 4[sin(@ + ¢) + sin(0 — ¢)]; (40) 
sin(20) = 2sin @cos 0, (41) 
cos(20) = cos? @ — sin? 6 = 2cos? @— 1 = 1— 2sin? 9, (42) 
tan(20) = (2tan6)/(1 — tan? 6). (43) 
Exercise 15 


Use equations (32)—(34) to derive the following identities. 
(a) Equation (35) (b) Equation (37) 
(c) Equation (38) (d) Equation (41) 


Exercise 16 


Use equations (33) and (34), and particular values of sin and cos, to 
simplify each of the following. 


(a) sin(27 — 0) (b) sin (F — 0) (c) sin(a — 8) 
(d) cos(a — @) (e) cos(27 — 6) (f) cos (¥ — 6) (g) cos (32 + 2) 


Taylor series for trigonometric functions 


The sine and cosine functions can be expressed as infinite series: 


. | ie ae 
sii =z ee 7 Paper ny (44) Note that these equations are 
3 5} ; 9! valid only in radians. 
2 4 6 8 
=. a Oe 
cose = 1 + foes, (45) 


2!) 4! 6! 8! 
Because the factorial n! becomes very large as n increases, these series sum 
to a finite number, no matter how large x is. They are the Taylor series 
for the sine and cosine functions. A frequent use of these expressions 
is to approximate sin @ and cos@ when @ is small. 


sinO~6@ (for <1 in radians), (46) The symbols < and >> mean 
iLa® : ; ‘much smaller than’ and ‘much 
cosé~1— 56° (for #< 1 in radians). (47) larger than’, respectively. 


29 


Unit 1 Getting started 


Engineers commonly use j to 


represent /—1. 


Im(z) is the real number 8; it is 
not equal to bi. 


An nth-order polynomial is 
sometimes referred to as a 
polynomial of degree n. 


In fact, this result also holds if 
the coefficients a, are complex. 


Repeated roots are sometimes 
referred to as equal roots or 
coincident roots. 


30 


4 Complex numbers 


Diagnostic test 


Try Exercises 17(a), 17(b), 17(e) and 18. If your answers agree with 
the solutions, you may proceed quickly to Subsection 4.2. 


Complex numbers provide a system within which we can solve any 
quadratic equation (and, indeed, any polynomial equation). They are used 
in many of the mathematical techniques introduced in this module. 


There is no real number «x satisfying the equation 
v= —1. 


However, there are circumstances where it is convenient to have a system 
of ‘numbers’ in which such an equation can be solved. Such a system is 
provided by the complex numbers. A complex number is one of the form 


z=a+bi (or equivalently, z = a+ ib), 


where i = /—1, and a and b are real numbers. We refer to a as the real 
part of z, written Re(z), and to b as the imaginary part of z, written 
Im(z). A complex number of the form a + 0i is, in effect, just the real 
number a; so the real numbers are seen as a subset of the complex 
numbers. 


The set of all complex numbers is denoted by C. Within C, we can solve 
any quadratic equation. For example, the equation «? — 2 + 2 = 0 has the 
solutions 


—24V22?-4x2 24V-4 24V4x V-1 242i 
= 2 ~ 2 2 wy 
2 


and the equation x —1 has the solutions x = +i. 


An nth-order polynomial with real coefficients is a function of the form 
p(t) = nz” + an—1z" | +--+ +a12 + a9, 
where a, # 0 and each coefficient a, (k = 0,1,...,n) is a constant in R. 


If x is allowed to be complex, any such polynomial can be written as a 
product of the form 


p(x) = an(x — C1) (@ — cg)... (@ — cp), 


where each cy (k = 1,2,...,n) is a complex number. Remember that real 
numbers are included in the complex numbers, so this does not prevent 
some (or all) of the cy, being real. The equation p(x) = 0 then has 

n solutions: © = cj, © = C9, ..., © = Cy. These are called the roots of the 
polynomial. If a factor z — c occurs more than once, then the root c is a 
repeated root. For example, the polynomial x? — 2cxr + c? = (x — c)? has 
the repeated root c. 


4 Complex numbers 


4.1 The arithmetic of complex numbers 


We can perform arithmetic with complex numbers, and this follows all the 
familiar rules for real numbers, such as 


uv+w)=uvt+tuw and uxv=vxXu. 


To add, subtract or multiply complex numbers, just manipulate brackets 
in the usual way, and remember that i? = —1. For example, 
(2+ 34) + (4-71) =24+44+31-71=6-4i 
and 
(2+ 3t) x (4-71) =2 x (4-71) +. 31 x (4— 71) 
= 8 — 14) +127 — 217” 
= 8421-21 
= 29 — 21. 
Division of complex numbers is a little more complicated. It is best 


described in terms of the complex conjugate, which is defined as follows: 
if z= a+ bi is a complex number, then the complex conjugate of z is 


Z=a-— bi. You will see the notation z* for 
the complex conjugate of z in 
Then, to divide one complex number by another, as in u/v, we multiply many pane ae : 


top and bottom by the complex conjugate 0 of the denominator. For 
example, to simplify 

2+ 30 

ee 
we multiply top and bottom by 4+ 72, the complex conjugate of the 
denominator, to obtain 


31 


Unit 1 Getting started 


Note that a? + b? is always 
positive, unless a = b = 0. 


32 


2+3i  (2+3%) x (4+ 7%) 
4-Ti (4-71) x (44+ 7) 
— 841444 121-21 
16 + 281 — 28: + 49 


—13 + 261 


This process always reduces the denominator to a real number because the 


product of a complex number a + bi and its complex conjugate a — bi is 
always real: 


(a+ bi) x (a — bi) = a? +.B. 


Given any complex number z = a + bi, its modulus |z| is defined to be 


Va? + b?, so we have 
[a Se" eo" = ee. 
The process of dividing one complex number, u = a+ bi, by another, 
v =c-+di, is then summarised by 
u_ uv uv — (a+bi) x (ce— di) 
vy vo fol? c2 + d? 


Exercise 17 

Let v = 3 — 47 and w = 2—7. Evaluate each of the following. 
(a) U (b) |u| (c) v-—w (d) vw 

(e) w/v (f) 1/w (g) w? (h) 2w — 3u 


Exercise 18 


Solve (for x in C) the quadratic equation 2x? + 2% +1=0. 


Notice that the solutions obtained in Exercise 18 are complex conjugates of 
one another. This is not accidental: any quadratic function with real 
coefficients has roots that are a pair of complex conjugates (of the form 

a+ bi). This follows directly from equation (9) for the solution of a 
quadratic equation. 


4.2 Polar form of complex numbers 


Diagnostic test 


Try Exercises 21(a), 21(c) and 21(f). If your answers agree with the 
solutions, you may proceed quickly to Subsection 4.3. 


4 Complex numbers 


Before discussing the polar form of a complex number, here is a brief 
review of polar coordinates. 


Polar coordinates 


Polar coordinates provide an alternative way of representing points in 
the plane. Figure 21 shows a point A with Cartesian coordinates (x,y) and 
polar coordinates (r,@). The quantity r is the distance from the origin 

to A, sor > 0. The angle @ is measured anticlockwise from the positive 
x-axis. (Negative angles correspond to measuring clockwise from the 
positive x-axis.) 


YA 
Y =P SIN O ovr » (x,y) 


T > 
x =rcosé se 


Figure 21 Cartesian (x,y) and polar (r,@) coordinates of a point A 


It is convenient to allow @ to take any real value, but this has the 
consequence that the polar representation of a point is not unique. For 
example, (r,@) and (7, + 27) provide polar coordinates of the same point. 
We can see from Figure 21 that if a point has polar coordinates (r, 6) and 
Cartesian coordinates (x,y), then 


x=rcos@é and y=rsiné. (49) 


These equations allow us to translate from polar to Cartesian coordinates. 
To translate from Cartesian to polar coordinates, we can use (see 
Figure 21) 


ra 27 +7", cos@=a/r, smP=—y/r (+0): (50) If 7 =0, then we can choose any 


lue for 0. 
Equations (50) do not have a unique solution for @ in R, but they do have ee 


a unique solution in the range —7 < 6 < 7. 


Exercise 19 
Locate each of the following points (x,y) on a diagram like Figure 21. 
(2.0), Gi, l=) C0, 4) (73,0: 


Hence find the polar coordinates (r,@) of each point, for 6 in the range 
—1T7<0<7. 


33 


Unit 1 Getting started 


0-10 Go 3 4 


—i- 


—9i- 


Figure 22 Argand diagram 
showing the point 3 + 27 


34 


Polar form of a complex number 


A complex number x + yi can be represented geometrically by treating its 
real and imaginary parts as Cartesian coordinates (x,y) in a plane. This 
produces a diagram known as an Argand diagram. For example, 

Figure 22 shows on an Argand diagram the point 3 + 27, with real part 3 
and imaginary part 2. 


By combining polar coordinates with the Argand diagram, we obtain the 
polar form of a complex number. For a complex number z = x + yi, we 
take the Cartesian coordinates (x,y) and convert them to the 
corresponding polar coordinates (r,@). Then, using the relation between 
polar and Cartesian coordinates, we have 


z=2+yi=x2+iy =rcosé+irsin#d = r(cosé + ising). (51) 


This is the polar form of z. Here, r = \/x? + y? = |z| is the modulus 

of z, and @ is an argument of z. As noted above, @ is not unique, but 
there is a unique value of @ in the range —7 < 6 < 7. This is called the 
principal value of the argument, and is written as Arg(z). When there is 
no possibility of confusion, we often write (r,) as shorthand for the polar 
form r(cos 6 + isin @). 


Exercise 20 


a 


If a complex number z has polar form (2, — *), what is its Cartesian form? 


Exercise 21 


Express each of the following complex numbers in polar form, choosing the 
principal value of the argument. 


(a) -2 (b) 1+i (c) -1-i (d) 4 (e) 4é = (f) -V34+i 


4.3 Euler’s formula 


Diagnostic test 


Try Exercises 23 and 24. If your answers agree with the solutions, you 
may proceed quickly to Section 5. 


The definition of the exponential function, equation (18), remains valid 
even when x is a complex number. There is a remarkable result that 
connects the exponential function and trigonometric functions, known as 
Euler’s formula. If x is a real number, Euler’s formula states that 


exp(ix) = cosx + isin a. (52) 


4 Complex numbers 


To see why this is true, replace x in equation (18) by ix. This gives the 


Taylor series for exp(iz): Note that the definition i? = —1 
. oO . “4 a5) . 
2 3 4 5 6 gives i? = —i, it =1, 2 =i, 
exp(iz) =1+ix—- = -i~+—+iz-—-.:::. (53) i® = —1, etc. 


2! 3! A! 5! 6! 
Collecting together the real and imaginary parts, we see that they 
correspond to the Taylor expansions of cos z and sin x (equations (44) 
and (45)) introduced at the end of Subsection 3.3: 


a zg  g* = : ag? 
exp(iz) = op ar ee +4 a a ‘ 


exp(iz) =cosx+isinz, or equivalently, e’” =cosx+isinz. 


‘Our jewel’ 


Euler’s formula is a very interesting result because it connects 
exponential functions, which are easy to deal with mathematically, 
with trigonometric functions, which occur in descriptions of waves, 
oscillations and vibrations. Richard Feynman (Figure 23), who was a 
noted educator as well a Nobel prizewinner for his work on quantum 
electrodynamics, referred to Euler’s formula as ‘our jewel’. 


One of the most common uses of Euler’s formula is in the description 
of wave motion (Figure 24). A wave, such as an undulation of the Figure 23 Richard Feynman 
height h of the surface of some water at position x and time t, might (1918-1988), Nobel laureate 

be described by a function such as 


h(x,t) = Acos(kx — ut), 


where k, w and A are real constants. In advanced calculations 
involving wave motion (including the ones that made Richard 
Feynman famous), a wave is often represented by a complex function 
of the form 


H(a2,t) = Aexpli(kax — wt)], 


and bongo player 


whose real part is given by the equation for h(x,t) above. It is often 
convenient to use this exponential form, and then take the real part 
at the end of the calculation. 


Figure 24 Euler’s formula 
can help us to understand 


Exponential form of complex numbers subtle phenomena involving 
i ; : wave motion, such as this 
Combining equation (51) for the polar form of a complex number with pattern caused by the partial 


Euler’s formula, equation (52), it follows that any complex number can be focusing of light waves 
written as 


z=ax2+iy=rexp(i6), 


where r and @ have the same values as in the polar form. This alternative 
way of expressing a complex number, the exponential form, is 
particularly useful when we need to multiply and divide complex numbers. 


35 


Unit 1 Getting started 


Note that although 6; + 2 is an 
argument of the product, it may 
not be the principal value of the 
argument. 


To avoid problems, n must be an 
integer in de Moivre’s theorem. 


36 


In this form, r is the modulus of z, and @ is the argument of z. As with the 
polar form, the value of 6 is not unique, but there is a unique choice of @ in 
the range —7 <0< Tf. 


Exercise 22 


Express the following complex numbers as complex exponentials with 
arguments in the range —7 <@< 17. 


(a) zy = 3e—t7n/2 (b) ga ellin 


Exercise 23 


A complex number has polar form z = Ge 0). Use the exponential form of z 
to find Re(ze’), i.e. the real part of ze’, where w and ¢ are real. 


Multiplication, division and powers revisited 


Multiplication and division of complex numbers is far simpler in 
exponential and polar forms than in Cartesian form. If 
zy =a, +iy, =rie™ and zo = 22 + iyo = ree", then 
2122 = (re) (rae?) = (rire) (Pi +62) (54) 
So to multiply two numbers in exponential form, we just multiply their 
moduli and add their arguments. In polar form this rule is written as 
r1(cos 6; + isin 01) x r2(cos 62 + isin 62) 
— ryr2(cos(O, + 62) + isin(O, + 62)), 
or, in shorthand notation, 
(71,1) X (ra, 02) = (rire, 01 + 02). 
Similarly, for division of complex numbers, 


i0 
Cl = rye" 7 _ TL ei(91—82) | (55) 


Zz. ree = rg 


So to divide two numbers in exponential form, we just divide their moduli 
and subtract their arguments. In shorthand polar form this means that 


(71,01) + (12, 82) = (r1/r2, 01 — 62). 


Further, if we multiply the complex number z = re” by itself repeatedly, 
we obtain a formula for an integer power of a complex number: 


z= (rel?) = pe”? (56) 
or 

ey =O" ne): 
For a complex number of unit modulus (i.e. one with r = 1), we have 

(cos 6 + isin 6)” = cos(n) + isin(né). (57) 


This result is known as de Moivre’s theorem (after Abraham de Moivre, 
1667-1754). 


5 Differentiation 


Exercise 24 


Express 1 — i in exponential form and hence simplify (1 — 7)?°. 


Euler’s formula also leads to useful expressions for the trigonometric 
functions in terms of complex exponentials. A complex number of unit 
modulus can be written as 


e? — cos8 + isin 8, 
and its complex conjugate is The complex conjugate is 
40 a obtained by changing the sign of 
e  =cos@— isin. i wherever it appears. 
Adding and subtracting these two equations then gives the results 
oor 
cos @ = 5 (e+e), (58) 
. 1) io -i6 
sind = —(e” —e"”). (59) 
2% 
These formulas find many applications. As an example, let us derive a 
trigonometric identity relating cos? 6 to cos 20. Using equation (54), we 
have 
mcr Lig ere): 
= 1 (id gid 4 Qe e189 4 ee) 
2 1 (¢2i0 424 e~2i8) 
_ L(t? ah e728) ge 1 
= 5 (cos(26) +1]. This is a rearranged version of 


equation (42). 


5 Differentiation 


The concepts and techniques of calculus are central to many of the 
mathematical methods discussed in this module. This section considers 
differentiation. 


Differentiation is fundamental to mechanics: if an object moving in a 
straight line has position x(t) at time t, then its velocity at time t is 
given by the derivative 
dx 
dt’ 
and its acceleration at time t is given by 

_ dv dx 


37 


Unit 1 Getting started 


You may use the Handbook to 
look up standard derivatives. 


Figure 25 A graph of 
height y against time ¢ for a 
ball thrown upwards 


38 


5.1 Derivative as a rate of change 


Diagnostic test 


Try Exercises 27(d) and 27(e). If your answers agree with the 
solutions, you may proceed quickly to Subsection 5.2. 


Differentiation gives the rate of change of one variable with respect to 
another. For example, consider the function (introduced in Subsection 1.1) 


V(t) =2 x 10° — 15t, (60) 


where V(t) is the volume of water in a reservoir (in cubic metres) at time t 
(in minutes, from midday on 1 June). In this case, the rate of change of V 
with respect to t is —15 (in cubic metres per minute). This is negative 
because the volume is decreasing, and it is constant because the volume of 
water is falling by the same amount each minute. For linear functions, 
such as this, the rate of change is always constant. This corresponds to the 
fact that a linear function has a straight-line graph whose gradient (or 
slope) is the same everywhere. 


Now consider a non-linear function, such as 
yt) =—A0F + 10E 2 (61) 


(introduced in Subsection 1.3), which gives the height (y metres at time 
t seconds) of a ball thrown vertically upwards with an initial speed of 

10 metres per second, from an initial height of 2 metres. In this case, the 
rate of change of height with respect to time is the vertical velocity v of 
the ball (which is positive when the ball is moving upwards, and negative 
when it is moving downwards). Differentiation allows us to show that the 
velocity of the ball (in metres per second) is 


v(t) = —9.8¢ + 10. (62) 


This is a function of time: the velocity of the ball is positive to begin with, 
and then becomes negative as the ball falls back to the ground. This 
corresponds to the fact that the quadratic function in equation (61) has a 
parabolic graph (Figure 25), with a gradient that varies from point to 
point. The process that takes us from the function y(t) in equation (61) to 
the function u(t) in equation (62) is called differentiation. The function 
u(t) is called the derivative or derived function of y(t). This function is 
denoted by dy/dt or y’(t), so in the present case 

dy 


— =y'(t) = —9.8t+ 10. 
ai (t) + 


At each value of t, this derivative gives the gradient of the graph of y 
against t. 


More generally, given a function f(x), the gradient of the graph of f(x) 
against x at a particular point x = 29 is equal to the derivative f’(a) of the 
function f at that point. This leads to a definition of the derivative, based 
on Figure 26. 


5 Differentiation 


t 7 > 
Xo to th 7 


Figure 26 The gradient of f at xg, defined as the limiting value of the 
gradient of the chord AB 


The gradient of the graph y = f(x) at x = xo is defined as the limiting 

value of the gradient of the chord AB in Figure 26, as B approaches A. 

The gradient of this chord is (f(xo +h) — f(#o))/h, and the process of 

allowing B to approach A corresponds to fh tending to 0, often written as 

h — 0. Hence the derivative of f at xg may be formally defined as follows. This definition assumes that the 
limit exists and is the same 
whether h approaches 0 through 
positive or negative values. 
When these conditions are not 
met, the function cannot be 
differentiated at xo. 


Definition 
The derivative of a function f(x) at x = z9 is 


Aaa lim (& is as 


Working from this definition, it is possible to obtain formulas for the 
derivatives of standard functions. You need not be concerned with the 
sometimes lengthy details. For our purposes, it is enough to know that all 
the basic derivatives that we need are tabulated in the module Handbook, 
and can be used as required. As usual, it helps to remember the most 
frequently used results, such as the derivatives of x«%, sinz, exp(x) and Ing, 
but this generally comes with practice rather than rote memorisation. 
Note that the basic derivatives are tabulated. The Handbook does not give 
the derivative of every function that you might encounter. Generally, you 
need to combine two elements: 


e = derivatives of standard functions 


e = rules for differentiating combinations of functions of various types, in 
sums, products, quotients and compositions. 


The skill in differentiation lies mostly in applying the rules. The simplest 
rule concerns constant multiples and sums. In general, the derivative of a 


39 


Unit 1 Getting started 


d 
In text, = may be written as 


dy/dt, to save space. 


40 


combination a f(x) + bg(x), where a and 6 are constants, is given by 


Sia f(a) + baa) =a 45. (63) 


To illustrate this, let us return to equation (61): 
y(t) = —4.9¢? + 10¢ 4 2. 
Applying patterns for derivatives given in the Handbook, we see that 


d 43 d 
att = 2h Fra) =1 and a (Constant) = 0; 
Hence, applying the rule in equation (63), we get 
dy ae d d 
a ee 10 —(#) + =(2 
oo a 
= —4,9(2t) + 10(1) +0 
= —9.8t+ 10. 


A similar process applied to the function in equation (60) gives 
dV /dt = —15, a constant negative value corresponding to the fact that V 
decreases at a constant rate. 


Notation for derivatives 


There are various notations for derivatives, some of which we have used 
above. We will use whichever is convenient in a particular context. 


Notation expressed purely in terms of variables, such as dy/dt, is referred 
to as Leibniz notation (after its inventor, Gottfried Wilhelm Leibniz 
(1646-1716)). This notation is extended to write, for example, 


“(ai +5sin2t) or (a2 + ba”). 


It is implicit that, in general, dy/dt is a function of t, but this fact may 


d 
sometimes be emphasised by writing = (t). In Leibniz notation, the value 


of a derivative at a particular point, such as t = 3, is written as “ 

t=3. 
The alternative to Leibniz notation is function notation, where 
differentiation is indicated by adding a prime (‘) to the function name. So 
the derivative of f(x) is f’(a). This clearly shows that the derivative is a 
function of x, and the value of the derivative at 2 = 3 is written as f’(3). 
To give another example, the derivative of h(t) is h’(t), and the value of 
this derivative at t = 4 is h’(4). We must take care to use this notation 
correctly. The symbol 7’, for example, means the derivative of the 
function y, but could lead to confusion if y is being used for both y(t) 
and y(2). 


Because the derivative of a function is itself a function, it can be 
differentiated again. For example, if y(a) = x? + 52, then 
dy 


Fe ay (@) = 30? +5. (64) 


This derivative is itself a function of x, and can be differentiated again to 
obtain the second derivative of y(xz). In Leibniz notation, we write the 
second derivative as d?y/dx?. In function notation, it is written with two 
primes, as y”(a). So the derivative of equation (64) is 


d {dy dy 


The value of this derivative at x = 4 is then denoted by 
dy 


i =24 or y"(4)=24. 
2 


= 


d3 
Differentiating again gives the third derivative, written a or y'" (x), 
z 


which may also be written y)(a). For equation (64), y/”(x) = 6. The 


TL 
process can be continued, and a general nth derivative is written as aah 
x 


or y\”) (2), where n is referred to as the order of the derivative. 


There is one final piece of notation to mention. Time (habitually denoted 
by t) is so often the independent variable that there is a separate 
notational convention for differentiation with respect to it. A dot is placed 
over the dependent variable to indicate a first derivative with respect to t, 
and two dots to indicate a second derivative. So if x(t) is the position of an 
object as a function of time t, then « means the same as dx/dt or 2'(t), 
while # means the same as d?x/dt? or x(t). As noted at the beginning of 
this section, these derivatives represent the velocity and acceleration of a 
particle in straight-line motion. 


The following exercises offer practice in differentiating standard functions, 
and constant multiples and sums of these. Refer to the Handbook for 
standard derivatives if necessary. 


Exercise 25 


Suppose that an object is moving in a straight line so that its position x 
(measured from a chosen origin) is related to time t by the equation 


x = 5+ 7cos(3t). 


Find expressions in terms of t for the velocity « and acceleration % of the 
object. 


Exercise 26 


The weekly wage bill of a company, t years in the future, is projected to be 
B pounds sterling, where 


B = 10° exp(0.04t). 


Find an expression for the rate at which the wage bill will be rising in 
t years’ time. What will this rate of rise be as a percentage of the wage bill 
at that time? 


5 Differentiation 


The derivative dy/dx is 
sometimes referred to as the 
first derivative. 


This notation is attributed to 
Isaac Newton (1642-1727), so is 
sometimes referred to as 
Newtonian notation. 


It will be helpful later on, 
especially in integration, if you 
can remember the derivatives of 
polynomials, exponentials, 
natural logarithms and 
trigonometric functions. 


41 


Unit 1 Getting started 


42 


Exercise 27 

Calculate the following derivatives. 
d 

(a) a where y = 1 — 9exp(—5z). 


(b) F’(2), where F(x) = 324 — 4x2 +1. 
2 


y 
(c) Tz’ where y = Int (t > 0). 
(d) g’(0), where g(t a = acos(3t) + bsin(3t) (and a and b are constants). 
(ce) F’(Z), where F(x) = 3sec(2x) — 4cos(—3z). 


We sometimes need to differentiate a complex-valued function of the 
form 


f(t) = g(t) +i h(t), 
where g and fare real functions. Differentiation of such a function follows 
the usual rule for constant multiples and sums. So 


f(t) =g'(t) +ih'(). 
For example, if f(t) = cos(3t) + isin(3t), then 
f' (t) = —3sin(3t) + 3 cos(3¢). 


Exercise 28 


Find the second derivative d? f /dt? of the function 

f(t) = cos(2t) + 7sin(2t). Do this in two ways: first by differentiating the 
real and imaginary parts, and then by using Euler’s formula to express this 
function as a complex exponential, which you can differentiate using 
equation (19). Check that you obtain the same result in both cases. 


5.2 Differentiating combinations of functions 


Diagnostic test 


Try Exercises 31 and 32(b). If your answers agree with the solutions, 
you may proceed quickly to Subsection 5.3. 


The rule for differentiating constant multiples and sums is natural and 
easy to apply. Rules for products, quotients and compositions of functions 
are also very important. They are given below and in the Handbook, but 
you will be much better off if you remember them through practice of use. 
We begin with the rules for products and quotients. 


Example 3 
(a) Find h’(x), where h(x) = x° cos(2z). 
(b) Find w(x), where w(x) = x?/(3x? + 1). 
Solution 
(a) h(x) is a product f(x) g(x), with f(x) = x? and g(x) = cos(2z). 
We have 
f'(z) =32? and g'(x) = —2sin(2z). 
So the product rule gives 


h'(x) = 3a” cos(2x) — 2x° sin(2z). 


(b) w(z) is a quotient u(x)/v(x), with u(x) = x? and v(x) = 32? +1. 


We have 
uw(z)=22 and v(x) =6z. 
So the quotient rule gives 
. 2x(3a? + 1) — 2? x 6x 2a 
wo (2) = = = 
(aa? ele (3a? + 1)? 


5 Differentiation 


43 


Unit 1 Getting started 


44 


Exercise 29 


d ] 
(a) Find a where y(xz) = at 
(b) Find f’(t), where f(t) = t? In(3t) and t > 0. 
(c) Find g’(0) (in terms of the constants A, B and C), where 
g(t) = (At + B) sin(Ct). 
(d) If the position of an object at time t is given by e~*" sin(4t), find its 
velocity and acceleration as functions of time. 


The rule for composite functions is a little more complicated to use. 


The composite rule 


If h is the composition of two functions g and u, so that 
h(x) = g(u(z)), then 

hi(a) = g'(u(a)) u(x). (69) 
Expressed in Leibniz notation, this rule looks rather different: if 
u = u(2) is a function of x, and h = h(u), then 

dh _ dh du 

dz du dx 
In this form, the composite rule is often called the chain rule. 


(70) 


The essential point is that the derivative of a composite function is found 
by differentiating the outer function and then multiplying by the derivative 
of the inner function. For example, consider differentiating 

h(x) = sin(a? + 2x). First, you should recognise that h(x) is the 
composition of two functions: h(x) = g(u(x)). The inner function is 

u(x) = x? + 22, and the outer function is g(u) = sin(u). To apply the 
composite rule, we first find the derivatives of these: u/(x) = 3x? + 2 and 
g'(u) = cos(u). Multiplying these then gives 


h'(x) = g'(u(x)) x ul(x) = cos(x? + 2x) x (3a? + 2). 


Alternatively, we can use the chain rule by writing u = x? + 22 so that 
h(u) = sin(u). Then 


dh dh du _ 2 _ 2 2 
7 ae cos(u) x (32° + 2) = cos(3a* + 2) x (3a* + 2). 
Example 4 


Find df /dx, where f(x) = sin? x = (sin). 


5 Differentiation 


Solution 
If we let u = sina, then we have f(x) = [u(z)]°. The recognition of sin® x as a 
. . composite function, and of how 
The chain rule then gives to break it down into two parts, 
df df du é i each consisting of a standard 
— =— — =3u' cosx = 3sin* © cos 2, function, is the key to 
dx du dx 


differentiating it. 
where we have replaced the variable u by sin x in the last step. 


Exercise 30 

Use the composite rule (or chain rule) to differentiate each of the following. 
(a) y(t) = exp(t?) (b) f(x) = (3x? + 4)® (c) z(v) = tan(3v + 4) 
(4) 9(z)=VE=2 —(e) Fe) =1/(VTF2@)* 


Exercise 31 


Differentiate the following functions. These differentiations involve 


more than one rule. 
= ee _ 42 3 
(a) y =sec 4] (b) z= t*exp(t? + 1) 
x 


Implicit differentiation 


Suppose that we want to find the gradient at the point (x,y) = (2,1) of 
the tangent to the ellipse with equation 


xz? + dy? = 8. (71) 
We want dy/dx at x = 2 and y = 1. We could start by expressing y as a 
function of x, but a more convenient approach is to differentiate the 


equation as it stands. To differentiate y? with respect to x, we use the 
composite rule, and obtain 


d(y’) _ d(y*) dy _, dy 


dx dy dx Ye 
So, differentiating both sides of equation (71) with respect to x, we obtain 
d 
22 +4 x 24y 4 =0. 
dx 


When x = 2 and y = 1, this gives 4+ 8dy/dx = 0, so dy/dx = —5. 
Therefore the tangent to this ellipse at (2,1) has gradient —3. 


Differentiation with respect to x of an expression such as x? + 4y?, where y 
is a function of x, is known as implicit differentiation. 


45 


Unit 1 Getting started 


Example 5 below illustrates ways 
of answering these questions. 


46 


Exercise 32 


(a) Use the product and composite rules to find the following in terms of 
x, y and dy/dz. 


(@) Sy) Gi) HO) Gi) Sle + sin(ay)) 


(b) Find the gradient at the point (—1,1) of the tangent to the curve 
etatyty? =F le 


5.3 Investigating functions 


Diagnostic test 


Try Exercise 33. If your answer agrees with the solution, you may 
proceed quickly to Section 6. 


Faced with an expression made up of some combination of standard 
functions, how might you investigate its behaviour? As an example, 
consider the function 
= v 
P=q + 1.5v + 0.00802’ 


where we would like to see how f varies as v varies. 


A sketch graph of f against v helps with this, and a computer package or 
graphics calculator will provide such a graph. However, it is not always 
obvious for what range of values to plot the graph, so it is helpful to be 
able to deduce some information about the general behaviour of a function 
‘by hand’, without recourse to a machine. Such information can also be 
used to cross-check results obtained from a machine, and to flesh out the 
picture more fully. This example will be continued in Exercise 34, but first 
consider some general remarks about sketching graphs. 


Questions that you might consider before sketching the graph of a function 
f(x) include the following: 


1. Are there any points where f(x) is not defined? 
2. Where does the graph of f(a) cross the axes? 


3. How does f(x) behave for large and small values of x (or at the 
endpoints of its domain if this is finite)? 


4. Are there any stationary points of f(x)? If so, are there any local 
maximum or minimum values? 


5. Where does the graph of f(x) have a positive gradient, and where 
does it have a negative gradient? 


The last two questions can be answered using differentiation. 


Definition 


A stationary point of a function f(x) is a value of x where 


= 


Local maxima and local minima occur at stationary points, although a 
stationary point need not necessarily be either. Figure 27 illustrates 
stationary points of various kinds. 


YA 
eee ee eee eee eens eee ee eee en eee 


local 
maximum 


gradient is 0, 
but neither 
maximum nor 
minimum 


Sy 


B 


Figure 27 Stationary points of a hypothetical function f(z) 


There is a local maximum at 20 if f(xo) > f(x) for all points x in the 
immediate vicinity of zo, and there is a local minimum at 70 if 

f(xo) < f(x) for all points x in the immediate vicinity of xo. Figure 27 
also shows a stationary point that is neither a local maximum nor a local 
minimum: such a point is called a point of inflection. 


To classify a given stationary point of f(a), we can use a test based on the 
second derivative, f(z). 


Classifying stationary points using second derivatives 
A stationary point xo of a function f(z) is: 

e a local maximum if f” (x9) < 0 

e a local minimum if f” (ao) > 0 


e a point of inflection if f”(xo) =0 and f(x) changes sign as x 
increases through xo. 


An alternative strategy is sometimes preferred; it is useful in cases where 
evaluation of the second derivative is messy. 


5 Differentiation 


Note that the condition 

f" (xo) = 0 is insufficient by 
itself to determine the nature of 
the stationary point. 


A7 


Unit 1 Getting started 


48 


Classifying stationary points using first derivatives 
A stationary point xo of a function f(z) is: 


e a local maximum if, for all x in the immediate vicinity of xp, we 
have f(a) > 0 for # < xo and f(a) < 0 fora > x9 


e alocal minimum if, for all x in the immediate vicinity of x9, we 
haves (x) =< 0 for.2 =< 29 and 7 (7) > Ouor 2 > 25, 


We often wish to find the overall maximum or overall minimum of some 
function, usually referred to as the global maximum or global 
minimum, respectively. The global minimum or global maximum of a 
function may well occur at a stationary point; but caution is needed, for it 
need not necessarily do so. For example, if the function f(x) in Figure 27 
is defined in the domain 0 < x < ~, then its global minimum occurs at the 
endpoint « = 0, which is not a stationary point. In fact, a function need 
not have a global maximum or global minimum. For example, the function 
in Figure 27 exceeds the local maximum value when « is large, but it never 
reaches a global maximum. Notice that the value of f(x) does not grow 
without limit. Instead, it gets arbitrarily close to the value A as x 
increases, but it always remains smaller than A, and never actually reaches 
this limiting value. The line y = A is called an asymptote of the graph 
of f(x). Such behaviour is sometimes indicated by writing f > A as 

x — oo, which is read as ‘f tends to the value A as x tends to infinity’. 


Exercise 33 
(a) Find any stationary points of the function 
y(x) =5—22Aax+1)e*/* («> 0). 


(b) Classify these as local minima or local maxima or neither, and 
evaluate y(x) at these points. 


Example 5 
Suppose that 
(a? — 3)y= a —2. 
Sketch a graph of y against x. 
Solution 
We address some of the questions listed at the beginning of this subsection. 
(1) Points where y is not defined. We have 
x—2 
x2 — 3 


This is not defined when x? — 3 = 0, ie. at x = +1/3. 


Y= 


(2) Points where y(x) crosses the axes. We can see that y = 0 if (and 
only if) « = 2, so the graph crosses the x-axis at this one point. 


(3) Behaviour for large values of x. If x is large (positive or negative), then 
y will be close to zero. 


(4) Stationary points and their classification. To look for stationary points, 
we use the quotient rule to calculate 


dy _ (1)(x* 3) ~ (w@ ~ 2) (22) 


dx (a? — 3)? 
_ a? +4¢-3 (x-1)(4-3) 
“Gay ee 


This is zero if (2 — 1)(a — 3) = 0. So the function has stationary points at 
¢=land2=3. 


In this case, the second derivative is a bit messy to calculate, and it is 
easier to look at the sign of the first derivative near « = 1 and x = 3 to 
check whether these stationary points are local maxima or minima. If x is 
just less than 1, then dy/dz is negative, while if x is just greater than 1, 
then dy/dzx is positive. Hence x = 1 is a local minimum. For x just 

below 3, dy/dz is positive, while for x just above 3, it is negative, so x = 3 
is a local maximum. The values of the function at these points are: 

y= 1/2 abe = 1, andy = 1/6 at a =38. 


(5) Regions of positive and negative gradient. Just below the local 
minimum, and just above the local maximum, the function has a negative 
gradient. Just above the local minimum, and just below the local 
maximum, it has a positive gradient. Because there are no other local 
minima or maxima, we can make further deductions: for example, the 
gradient remains positive everywhere between the local minimum at x = 1 
and the point « = 3, where the function is not defined. 


Yr 


SY 


Figure 28 A graph of y = (a — 2)/(x? — 3) 


5 Differentiation 


Try « =0.9 and #=1.1. 


Try ¢ =2.9 anda =3.1. 


49 


Unit 1 Getting started 


50 


Incorporating all this information (plus other information, such as the 
behaviour of y near 2 = +/3), we can produce a sketch graph of the 
function. Figure 28 exhibits all the main features. 


Exercise 34 


Consider the function 


f= Tam cone? 9) 

i) Find any values of v for which f(v) is not defined. 
ii) Find any values of v for which f(v) is zero. 

iii) Indicate how f(v) behaves as v becomes large. 


iv) Find any local maxima or minima of f(v), and evaluate f(v) at 
these points. 


(b) Sketch a graph of f(v), and use it to deduce the global maximum and 
global minimum of the function. 


6 Integration 


Integration arises in two different contexts. First, it ‘reverses the process of 
differentiation’. Subsection 6.1 provides a reminder of this basic idea, and 
Subsection 6.2 discusses how to find relatively simple integrals, referring to 
the Handbook for standard results if necessary. Subsection 6.3 looks at two 
special techniques for finding more complicated integrals by hand. 


Integration also arises as a kind of summation. For example, the mass of 
an object can be expressed as the integral of a function that describes how 
the object’s density varies from point to point. Such definite integrals are 
discussed in Subsection 6.4. 


6.1 Reversing differentiation 


Diagnostic test 


Try Exercise 35. If your answer agrees with the solution, you may 
proceed quickly to Subsection 6.2. 


In the rest of this book, and elsewhere in the module, you will meet a 
variety of differential equations. These are equations involving the 
derivative of a function. 


6 Integration 


For example, if x is the position of a particle and ¢ is time, then the 
velocity (v = dx/dt) might be given by the equation 
dx 


— = 5b. 
tT (72) 


The objective is usually to ‘solve’ the equation, so in this case we want to 
find the position x as a function of time t. To do this involves ‘reversing’ 
the differentiation, and this process is referred to as integration. In the 
above example we integrate both sides of the equation with respect to t, 
obtaining 


= re (5 +7) dt. (73) 


To evaluate this integral, you can use a table of standard integrals in the 
Handbook. The result is 
c= 3t?+7t+C, (74) 


where C’ can be any constant. To confirm that equation (74) really is the 
solution of equation (72), we can differentiate it, obtaining 


dg dj» 

SS (AP eC) = Sey, 

dt at? ) 
as required. The constant C' is often referred to as an arbitrary constant Note that the arbitrary constant 
or a constant of integration. Its presence means that the differential is included once the expression 


has been integrated (as in 


equation (72) d t h lution. 
q (72) does not have a unique solution eagioh (nigh bo. 


More generally, suppose that f(a) is a known function, and F(z) is an 
unknown function satisfying the differential equation 


F'(x) = f(z). 


Then we write the solution of this equation as 
F(a) = | (eax, 


where the right-hand side, [ f(x) dz, is called the indefinite integral of 
f(x), and the function to be integrated, f(x), is called the integrand. 
For example, the differential equation 

1 
+2 
has solution 


1 


We now need to find the integral. In this case we are lucky, since the table 
of standard derivatives in the Handbook gives 


il 
1+ 22° 


F"(a) (75) 


d 
= (arctan 2) = 


51 


Unit 1 Getting started 


Any function that differentiates 
to 1/(1 +27), such as 

F(a) = arctan x + 5, is referred 
to as an integral (or 
antiderivative) of 1/(1 +2). 


52 


Hence we have 


1 
/me =arctan’+C, 
1+ 2? 


where C is an arbitrary constant. 


By contrast, consider finding the integral 


[ext-2) dx. 


At first sight, this might seem no harder a problem to solve than 

equation (75). In fact, however, it is impossible! To be more precise, there 
is no simple combination of the elementary functions (polynomials, sin, 
cos, exp and In) that when differentiated gives exp(—2’). 


Finding explicit expressions for integrals is a much harder task than 
finding derivatives. The rules of differentiation ensure that we can, in 
principle, find an explicit expression for the derivative of any combination 
of elementary functions. The equivalent is not true for integrals. What is 
more, even when integrals can be found, the working needed can be messy. 
The art of integration often relies on recognising patterns and knowing 
what works in particular cases. 


There is a table of standard integrals in the Handbook. An integral will be 
readily found if it fits into one of the patterns listed there (such as [ e%” dx 
or { x“ dx for a # —1). Note that the table in the Handbook does not 
include arbitrary constants — you must supply these yourself. There are 
also simple rules for integrating constant multiples and sums. More 
complicated integrals can often be found using two techniques that will be 
introduced later: integration by substitution and integration by parts. 


The table of standard integrals in the Handbook contains quite a wide 
selection of integrals. Some of these integrals are deduced from the table of 
standard derivatives, others by using integration by parts or substitution. 
You can regard them all as the fruit of others’ experience, and draw on 
them as needed. The correctness of an integral obtained by using the 
Handbook can always be verified by differentiation. 


Exercise 35 


Use differentiation to verify that the following integrals are correct (where 
a #0 is a constant and C is an arbitrary constant). 


1 
(a) [ esin(az) i= -= cos(ax) + = sin(ax) + C 


(b) [ ten(aa) de = -< In(cos(ax)) +C  (—3 < ax < $) 


6 Integration 


6.2 Evaluating integrals 


Diagnostic test 


Try Exercises 37(a), 37(b) and 39. If your answers agree with the 
solutions, you may proceed quickly to Subsection 6.3. 


Your first recourse for finding an integral by hand is the table of standard —_ Or, preferably, use your memory! 
integrals in the Handbook. If the integrals of functions f and g are known, 

then the integral of af + bg, where a and 6 are constants, is readily found, 

using the rule 


[ere Lig@ie= af fe) dx +b [ g(x) Ap (76) 


So, for example (referring to the Handbook for [ e** dx and f a’ da), 


[e+ 92) de=4 f ePac+9 fo" de 


= 4(5e"") + 9(42°) +C Each of the integrals introduces 
5 22 | 9.8 an arbitrary constant, but the 
= 2e™ + ge + C. sum of these is also an arbitrary 


Sometimes algebraic manipulation can transform an expression to be constant, so we need only one. 


integrated into a more amenable form. For example, the manipulation 


3a? + Ie = 3x? 2x — 33/2 4 og1/2 


a 
ft Vi ff 
transforms the expression on the left into a sum of constant multiples of 
integrals tabulated in the Handbook. Less obvious transformations can be 
achieved using trigonometric formulas. For example, equation (42) can be 
rearranged to give 


cos? x = 5(1 + cos2z), 


which enables us to integrate cos? x. 


Exercise 36 
Use the identity 


cos”(ax) = $[1 + cos(2az)| 


(where a £ 0 is a constant) to obtain [cos*(ax) dx. 


At times, attention needs to be paid to domains, to avoid giving, as 
integrals, expressions that are not defined. For example, if « > 0, then 
there is no difficulty in writing 


[earamete (x > 0), (77) 


53 


Unit 1 Getting started 


Use the standard integrals given 
in the Handbook as necessary. 


Use the standard integrals given 
in the Handbook as necessary. 


54 


but the right-hand side of this expression makes no sense if « < 0 because 
Inz is not defined in this case. (The domain of Inz is x > 0.) For x < 0 we 
have instead 


1 
/ “de =In(-2)+C (<0). (78) 
Equations (77) and (78) are both valid, because 
d 1 d 1 1 
qq nt) = and qq n(-2) per (-1) = z 


The results in equations (77) and (78) are sometimes combined in the 
formula 


[oe =In(|2|)+C (« £0), 


where |x| is the modulus of x, equal to +a if x > 0, and to —2 if x < 0. 


Exercise 37 


Find the following integrals. 


(a) [era (b) [ ssec2(at) dt (c) /aome 
@) fw (<3) © [4 (y > 3) 


Exercise 38 


Find the following integrals. 


; 1 
(a) [(6cos(—24) + 8sin(4t)) dt (b) l= 
3 
(c) /* ae (t<0) (a) [ (mao 7 =) dt (t>0) 


1 
(e) lames" ene i 


dt (-3<t<3) 


The next example again uses a standard integral from the Handbook, but 
requires careful matching of parameters and attention to domains. 


Example 6 


1 1 
For a> 3 >0,find = f de. 


Solution 


The Handbook contains the standard integral 
r-—a 


1 1 
fms” x—b 


The given integral J can be expressed in the form 


+C. 


—1 
~ A(x — 0) (c— 4)’ 
which matches the Handbook integral with a = 1/A and b = 0. 


Hence 
1 1 1 1 a—+ 
f=-— dx = —— Alj)+ic 
AJ (@—0)(e@—4)~ 4(x45 = 
2 
=—In NES e) 
x 


The modulus signs can be removed because x > 1/A and x > 0. Hence 


I=n|—, +o=m( = J+e. 
L—- Ar-1 


Exercise 39 


1 
Por k>Oand -k<u<h, find [ 


peake 


(Hint: Remember that v? — k? = (v — k)(v+k).) 


dv. 


6.3 Integration by substitution and by parts 


Diagnostic test 


Try Exercises 40(a), 40(b) and 42(a). If your answers agree with the 
solutions, you may proceed quickly to Subsection 6.4. 


This subsection looks at two useful methods for finding more complicated 
integrals. Deciding which method to use in any particular case comes 
through experience. 

Integration by substitution 


Many integrals are of the form 


du 
— d 
[ tu) Fae. 
or are constant multiples of such an expression. For example, the integral 
[cost + 327) x 6x dx (79) 
is of this form, with 
2 du 
u(x) =24+ 32°, f(u)=cos(u) and i 6x. 


6 


Integration 


55 


Unit 1 Getting started 


In this case, f(u(ax)) = cos(2 + 3x”) is the composition of an ‘inner 

function’ u(x) = 2+ 32? and an ‘outer function’ f(u) = cos(u). But 
f(u(x)) is only part of the integrand because it is multiplied by the 
derivative du/dx = 6x of the inner function, u(x) = 2 + 32. 


This method is also referred to Integrals like this can be found using the method of integration by 
as integration by change of substitution, based on the following formula. 
variable. 


Formula for integration by substitution 


J fluyue) ax =f F(u) dus (30) 


or, in Leibniz notation, 


J fu) Tae = f f(u) du (81) 


d 
In equation (81), the expression ts dx in the integral on the left is 
by 


effectively replaced by du in the integral on the right. This is worth 
remembering, and will be used later on. 


Example 7 


Find [% cos(2 + 327) dx. 


Solution 
We try the substitution u(x) = 2+ 327. Then f(u) = cos(u) and 
du/dxz = 6x. So 


d 
[cost + 327) 62 dx = [ costu) — i= [costu) du. 
The integral over u is now easy. It is sin(w) + C, so we have 
[ost + 3a") 62 dx = sin(2 + 327) + C, 


where we have substituted for wu in terms of x. 


The rule for integration by The answer can be checked by differentiation: 
substitution can be thought of r, 

as the reverse of the rule for —(sin(2 + 3x”) + C) = cos(2 + 32”) x 6z. 
differentiating a composite dx 

function. 


Sometimes a rearrangement is needed to coax the integrand into a suitable 
form. For example, given the integral [ x cos(2 + 3x?) dx, we need to 
recognise that this is essentially the same as the integral that we have just 
considered, but with 6x replaced by x. Such a constant multiple is easily 
dealt with by writing 


[ veost2 + 327) dx = £ | cos(2 + 3x”) x 6a dz. 


56 


6 Integration 


We then proceed just as before to obtain ¢ sin(2 + 327) +C. 


The key to using this method is recognising when the integrand has a 
suitable form: be on the lookout for an inner function u(x) whose 
derivative du/dx appears as a multiplicative factor in the integrand. 


One form of integral comes up sufficiently often to be worth special 
mention. If g(a) 4 0, integration by substitution gives 


g{2) 1 — nlate 
[oS dz = In|g(x)|+C. (82) 


To see why this is so, make the substitution u = g(x). Then 
du/dx = g(x), so 


/ 
[Rw ade = f du. 
g(x) u dx u 


Finding the integral of 1/u requires some care with domains (see the text 
following Exercise 36). For u > 0 it is In(w) + C, and for u < 0 it is 

In(—u) + C. These cases correspond to g(x) > 0 and g(x) < 0, respectively. 
Both cases are included by using the modulus sign in equation (82). 


Exercise 40 

Find the following integrals by making suitable substitutions. One advantage of knowing your 
derivatives is that you can 

(a) fv exp(2 + Ay?) dy (b) [os y sin? y dy adjust your tactics, choosing 
u(x) so that u’(a) appears where 

r needed in the integrand. 
(c) [i-eat (-l<t<1) (d) f/m 
x 
© [| dt ) [a Corer dhe identity in@e) = Sein Feast 
e ——_ — =e e identity sin(2t) = 2sintcos 
1+sin?¢ i-f ° y 


may be useful in part (e). 


All the integrals considered in Exercise 40 have integrands that can be 
expressed as a constant times f(u(a)) u’(x) for some choice of u(a). We are 
not always so fortunate. The integral in the following example is not of 
this pattern, but we can still use a substitution to help find it. 


Example 8 


Find the integral J = dx for @ > 1/2. 


a2 
/ V2x—1 
Solution 
We try the substitution u = 2x — 1. Then du/dx = 2 and the denominator 
in the integrand becomes /u = u!/?. We still need to express the 
numerator x? in terms of u, and the element of integration dx in terms 
of du. The first task is easily achieved. Rearranging our equation for u, we 
get r = (ut+1)/2, so x = (u+1)?/4. 


57 


Unit 1 Getting started 


To express dz in terms of du, we can use the fact noted earlier that within 
the integral in equation (81), we can effectively make the replacement 
(du/dx) dx = du. Since du/dx = 2, we have 2dx = du, so we can make the 
replacement dx = 5 du. 


Putting all this together, we get 


x ,(ut1), 


Now we are in business! A straightforward calculation gives 


2 
4, fwt2ut+l 
a3 fa 


= gf i? 4 Gull? a1?) de 


=1 (30? + 4u3/? + 2ul/?) EW 6. 
= p(2r — 1)°/? + Ear — 19/7 + F(2e- 1) +. 
Exercise 41 


Ox? +1 
3874 +1 


Use the substitution u = 3x + 1 to find / dx for « > —1/3. 


Integration by parts 


The method of integration by parts is based on the following formula. 


Formula for integration by parts 


[t@ g(x) dx = f(x) g(a) — [f@s@) da, (83) 


Note that an arbitrary constant or, in Leibniz notation, 

need not be included in the a df 

expression for g(x) here, and it / ap) dhe = Pe) aha — || Sale are. 84 
is usually omitted. A ig P(x) (2) dx G2) Ce 


As with integration by substitution, this formula transforms an integral 
into a different one, and the key to success is to ensure that the ‘new’ 
integral is easier to evaluate than the original. 


Example 9 
Find jae dx. 


58 


Solution 
Take f(x) =x and g/(x) =e”. Then f’(x) = 1 and g(x) = —de~?*, so 


par dx =x (—se 7”) — / 1x (—3e7”) dx 
_ sxe ** + 5 fem dx 


— _1,,-22 _ 1,—-22 
= —57e ze +C 


=-4(2r+ le" +C. 


Our motive for splitting the integrand into x x e~?” in this way, is that x 


becomes simpler when it is differentiated, while e~?* at least gets no more 
complicated when it is integrated. So we end up with an integral that is 
easier to perform. 


Exercise 42 


(a) Use integration by parts to find fee dx. 
(b) Use integration by parts and the result of part (a) to find jae dx. 
(c) Use integration by parts to find / e” sina dz. You will need to 


integrate by parts twice, and compare your answer with the original 
integral. 


6.4 Definite integrals 


Diagnostic test 


Try Exercises 43, 45 and 46. If your answers agree with the solutions, 
you may proceed quickly through this subsection. 


An indefinite integral is a function (or, to be exact, a family of functions 
containing an arbitrary constant). A different, though closely related, 
integral is the definite integral, whose value is a number. 


If F(x) is the integral of f(x), so that 


[t@ar= Fea), 


then the definite integral of f(x) between x = a and x = b is written as 
ri f(x) dx, and is defined by 


b 
‘l f@de= rb) — Flay. (85) 


6 


Integration 


59 


Unit 1 Getting started 


60 


The value x = a is called the lower limit of integration, and the value 
x = b is called the upper limit of integration. A particular expression 
for F(x) will contain an arbitrary constant, but this does not affect the 
value of the definite integral because the constant cancels out in 

equation (85). The difference F(b) — F(a) is commonly written as [F (x)]? 
or F(2)|?. So, for example, 

1 


dé = [arcsin (39)| — arcsin = — arcsin0 = 


0 2 == 


a us 
6 6° 


ts 

| V4-@ 
A useful way of thinking of a definite integral [ i f(x) dz is as an 
accumulation of small quantities as x varies from a to b. For example, 
suppose that a straight rod lies along the x-axis and has a density per unit 
length given by the function f(a), which varies along the length of the rod. 
The mass of a small segment of this rod, of length 6x, centred on z, is 
approximated by f(a) 62. Some approximation is involved because the 
density varies inside the segment, but this variation is very slight if the 
segment is short enough. If we add up the masses of all the segments, we 
obtain the total mass of the rod. We can do this in the limit where the 
length of each segment approaches zero and the number of segments 
becomes huge. This improves the approximation mentioned above, and in 
the limit gives the exact total mass of the rod as the definite integral 


m= [sloyae 


where x = a and x = b mark the ends of the rod. 


Example 10 


A straight rod lies along the x-axis between x = 2 and x = 4 (measured in 
metres). The density of the rod per unit length depends on z, and is given 
by the function f(x) = 32? — x (measured in kilograms per metre). What 
is the total mass of the rod? 


Solution 
The total mass of the rod is 


4 4 
M= fle) de = f (8x? ~ 2) de = [2° — $2]) = 56 ~ 6 = 50. 
2 2 


So the rod has mass 50 kilograms. 


Exercise 43 


Calculate each of the following definite integrals. 


1 

(a) / (a? — 2) dx 
2 

(b) i (3 — 2) de 


2 
(c) / (03 — 2) dex 


How is this integral related to those in parts (a) and (b)? 


Exercise 44 


3/2 1 
Evaluate | ——~ dz 
0 9 + 42? 


Definite integrals can be evaluated by finding the corresponding indefinite 
integral, and then using the limits of integration. When using integration 
by substitution, there is an extra step involving conversion of the limits. 


Example 11 
Suppose that we wish to evaluate the integral 
3/2 1 


I= ———————— ix. 
1/2 /4— (22-1)? 


The closest integral to this one in the table of standard integrals in the 
Handbook is 
t a fh 

l= «x = arcsin (=) F 
To convert J into this form, we change the variable of integration to 
u = 2x —1. Then du/dx = 2. This allows us to write 
du = (du/dx) dx = 2 dz, so we can make the replacement dx = 5 du. 
Consequently, our integral becomes 


i= / ae (36) 
=e a ————te OHS 
2 Je=i/2 V4—u7 


Here the limits are written as x = 1/2 and x = 3/2 to explicitly show that 
they refer to the variable x rather than u. We can easily convert to the 
appropriate limits for u, using the relationship between u and zx. If 

x = 1/2, then u = 0, and if x = 3/2, then u = 2. Hence, using the integral 
from the Handbook, the integral J becomes 


if i 1 U\[u=2 oo 
raf eens bom Mt 
5 [ > ta U 5 arcsin Bich Fi 


Exercise 45 


Suppose that we are told that for any positive integer n, 


nt 3-0 
; nem nt 
[i u’ sin? udu = —— — —. 
0 6 4 


Use this integral to evaluate 


1 
r= | x? sin? (3rx) dx. 
0 


6 


Integration 


61 


Unit 1 Getting started 


Figure 30 Graph of sin(z) 


over the interval [—7, 7] 


62 


Areas and the use of symmetry 


Another way of thinking about a definite integral is in terms of an area. If 
f(x) > 0 for a <a < b, then the definite integral fl f(x) dx is equal to the 
area under the graph of f(x) between x = a and x = b (see Figure 29(a)). 
There is one point to be careful about here. If f(x) < 0, corresponding to 
a region below the x-axis, then we have a negative contribution to the 
integral, whereas area is always a positive quantity. Thus for a function f 
as pictured in Figure 29(b), i f(x) dx = area A; — area Ad. 


Figure 29 Pr f(x) dx as an area: (a) with f(x) > 0; (b) in general 


It is possible for the positive contribution to an definite integral to exactly 
cancel the negative contribution, giving a zero result. This leads to a 
useful shortcut in some cases. Recall the following definitions: 


e A function f(x) is said to be odd if f(—x) = —f (x) for all x. 
e A function f(z) is said to be even if f(—x) =+f(2) for all x. 


If a definite integral is over a range from —a to a, where a is a constant, it 
is worth checking whether the integrand is an odd function, as you will 
now see. 


Suppose that you are asked to calculate the following definite integrals: 


T 2 
i= / sin(a) da, Io= ) sin(x?) dx. 
—T —2 

You could set about evaluating these, using the methods that you have 
learned. And you might conclude that the second integral is a real 
challenge, because you don’t know how to find an indefinite integral for 
sin(x?). But there is an alternative approach that is very valuable. 
Figures 30 and 31 show plots of the integrands over their ranges of 
integration. 


From Figure 30 we see that sin(x) is an odd function (that is, 

sin(—x) = —sin(zx)). Because the limits of integration are equally spaced 
either side of the origin, the contribution to the definite integral coming 
from any positive value of 2 is exactly cancelled by a contribution 

from —x. By symmetry, we see that the integral must be equal to zero: 
I, =0. This argument can be generalised as follows. 


6 Integration 


Definite integral of an odd function over a symmetric range 


The integral of an odd function over a range —a < x < a vanishes: 


: ada — 0h 


Similarly, Figure 31 shows that sin(x?) is an odd function. This can be 
confirmed by noticing that if f(x) = sin(x?), then 


f(—2x) = sin((—x)*) = sin(—2*) = —sin(x*) = —f (a). 
In Jy, sin(x*) is integrated over a range centred on the origin, so we can 


immediately say that If = 0 — in spite of being unable to find the 
corresponding indefinite integral! 


Figure 31 Graph of sin(x?) 
over the interval [—2, 2] 


Evaluating definite integrals can be tedious and a source of errors when 
performing calculations by hand. Looking out for opportunities to use 
symmetries to show that integrals are equal to zero, or else equal to others 
that you have already calculated, is a valuable skill that will save you time. 


Exercise 46 


Use symmetry to evaluate (or help to evaluate) the following integrals. 


7 1 
(a) ; e* sina dz (b) | asin(1 + a*) dx 
- = 


(c) ie (a? + 2°) cos(z°) dx 


63 


Unit 1 Getting started 


64 


100- 


20- | 
I 


300 500 700 A 
(a) (b) 


Figure 32 (a) A graph of intensity J against wavelength \ for the 
visible spectral lines emitted by oxygen atoms. These spectral lines 
are limited in number because certain integrals vanish for reasons of 
symmetry. The green line is by far the most intense, and this is 
responsible for the magnificent green glow of (b) the Aurora Borealis. 


Learning outcomes 


After studying this unit, you should be able to do the following. 


Understand the following terms: variable, dependent variable, 
independent variable, parameter; domain and image set of a function; 
inverse function, linear function, quadratic function. 


Solve two simultaneous linear equations by Gaussian elimination. 
Solve quadratic equations, using a formula or by factorisation. 
Compose functions and recognise compositions of functions. 


Recognise, sketch graphs of and manipulate the power, exponential 
and (natural) logarithm functions. 


Recognise, sketch graphs of and manipulate trigonometric functions 
and their inverses. 


Be aware of the series expansions of sin, cos and exp. 


Add, subtract, multiply and divide complex numbers, and move 
between Cartesian, polar and exponential forms of a complex number. 
Express sin and cos in terms of complex exponentials. 


Differentiate functions, using a table of standard derivatives and the 
rules for differentiating products, quotients and compositions of 
functions. 


Identify stationary points, local maxima and minima, and asymptotes; 
use such information to sketch graphs of functions. 


Find indefinite integrals, using a table of standard integrals and (in 
simple cases) rules for integration by substitution and by parts. 


Find definite integrals, using substitution and symmetry arguments 
where appropriate. 


Solutions to exercises 


Solution to Exercise 1 


(a) The domain of f(x) is given in the question as x > 0. Since f(0) = 6, 
and f(x) increases indefinitely as x increases, the image set of 
y = f(x) is y> 6. This can also be written as 6 < y < co. 


(b) Setting y = 2x? + 6 and solving for x gives x = +,/3(y — 6). 
However, we are told that x > 0, so we can ignore the negative 
solution. The inverse function is therefore x = g(y) = ,/4(y — 6). 
The domain of the inverse function g(y) is equal to the image set of 
the original function, and so is y > 6. The image set of the inverse 


function x = g(y) is equal to the domain of the original function, and 
so is xz > 0. 


Solution to Exercise 2 
(a) For Y = 2000 at t = —3600, we have 
2000 = 5(—3600) + c = —18000 + c. 
Hence c = 2000 + 18000 = 20000. 


(b) (i) The coastguard vessel catches the smuggler’s boat when X = Y, 
i.e. when 


7t = dt + 20000. 
This gives 2t = 20000, so t = 10000. 


10000 seconds is 2 hours, 46 minutes and 40 seconds. So the 
smuggler’s boat is caught at about 2.47 am. 


(ii) At t= 10000, both X and Y are equal to 70000. So the 
smuggler’s boat is caught 70km from A, which is inside 
territorial waters. 


Solution to Exercise 3 


Multiplying the first equation by 3 gives 


15,, __ 57 
3u-sv= = 


Subtracting this from the second equation gives 
(4+ 2) v= -29- 2%, 

Le. By = -. so v= 2 = —5. 

Substituting this into the first equation gives 
2u — 5(—5) = 19, 

so u = (19 — 25) /2 = —3. 


Thus the solution is u = —3, v = —5. 


Solutions to exercises 


65 


Unit 1 Getting started 


(It is good practice to check solutions where you can, and this is easily 
done here. With u = —3 and v = —5, we have 


2u — 5v = 2(—3) — 5(—5) = -6 + 25 = 19, 
3u + 4v = 3(—3) + 4(—5) = —9 — 20 = —29, 


so these values of u and v do satisfy the given equations.) 


Solution to Exercise 4 


(a) Using equation (9), we obtain 


tS 7? —4x 2x (-4) 
—— 2x2 
_ —7+ /49+ 32 
7 4 
—7 =E, 9 

— ri a ‘ or —4. 
(These solutions can be checked by substitution into the quadratic 
equation. For example, with « = —4, 


2a* + Tz — 4 = 32-28-4=0, 
as required.) 


(b) Using equation (9), we obtain 


—-1l+ 1?-—4x 1x (-6) 
os 2x1 
— -l4V1+24 
SS 


Solution to Exercise 5 


Using equation (9), we obtain 


_ -2K + V4K? — 44? 
7 o 

_ -2K 4 2VK? — ou? 
— 2 

=— ai K? —w?, 


as required. 


Solution to Exercise 6 


(a) We could use equation (9); however, in these cases it is easier to 
factorise by hand. 


This is a difference of two squares: 


x” —a = (x— Va)(x + Va). 


66 


(b) This can also be expressed in terms of a difference of two squares: 


Qn” — 8a = 2(x* — 4a) = 2(2 — 2/a)(x + 2a). 
(c) This is a perfect square. Let y = x7. Then 
a 69" $94" — by 4-9 = (y= 3) = (2? = 3). 


Solution to Exercise 7 
(a) oa = = gh. 

(b) w/a? =a? =a (or 1/0"): 

(c) (a a) = qeX _ qa. 

dia 2 = 0->™ oo ee ee 
( = Ae 1/%#8 = 

( 


ie = (5 - (i) = P= 8. 


@) (4)°? =(/8) =@°=8- 
(h) (162:4)1/2 _ 161/2(e*)\V2 = J16x4%1/2 = 4r?. 


e) 8 
f) 


Solution to Exercise 8 
(a) With f(x) = x2? and g(x) = 1/(a — 1) we have the following. 
(i) f(g(2)) = (A/(# — 1)? = (A/(@ - 1)?). 
(ii) g(f(w)) = 1/(a? — 1). 
Note that g(f(a)) is not the same as f(g(z)). 
(b) We can obtain h(x) in three steps. 
Step 1 Apply the square function to x. 
Step 2. Add 1 to the result of Step 1. 
Step 3 Apply the sine function to the result of Step 2. 


Then h(x) = r(q(p(x))), where 
2) = x, gej=e+41, 7(2)—sin(e): 
Solution to Exercise 9 
For x = 1/2 the sum is 
1/2)? 1/2)3 1/2)4 1/2)? 

144442", G2), Gy", GRY ~ 1.649. 
For x = 1 the sum is 

iei¢ be pees 4 wore. 
For xz = 2 sum is 


LoS 7+ + + Bt Apt sty T7881 


Solutions to exercises 


67 


Unit 1 Getting started 


68 


These answers can be compared with those obtained by entering exp(1/2), 
exp(1) and exp(2) directly into a calculator: 


exp(1/2) = 1.648 721271..., 

exp(1) = 2.718 281 828..., 

exp(2) = 7.389056 099.... 
Our first two answers are correct to four significant figures. The last 
answer is correct to only two significant figures, but it could be made as 
accurate as we wish by adding more terms in the series. Note that as x 
increases, we need more terms in the sum to get an accurate answer for 
exp(z). 


Solution to Exercise 10 


Using a calculator we find 


exp(1/2) x exp(1/2) = 1.648 721271... x 1.648721 271... 
= 2.718 281 829..., 


which agrees with exp(1) = 2.718 281 828. (Any discrepancy in the last 
decimal place is insignificant because of rounding.) 


Similarly, 
exp(1) x exp(1) = 2.718 281 828... x 2.718281 828... 
= 7.389056 096..., 
which agrees with exp(2) = 7.389 056 099.... 


Solution to Exercise 11 

(a) n7+In4—In1l4=In(7 x 4/14) = 1n2. 

(b) Ina+2Inb—In(a?b) = Ina + In(b?) — In(a”d) 
= In(a x b? + (a*b)) 
=In(b/a) (or nb—Ina). 


(c) To simplify In(e* x e”), we first rearrange it in the form In 
which just equals something: 


(energy ) , 


In(e” x e¥) = In(e*T¥) = a+. 
An alternative argument gives 
In(e” x e¥) = In(e”) + In(e¥) = a+ y. 


(d) In parts (d)—(f), we first rearrange the expression as e!™(semething) 
which also just equals something. 
Here, e2@ = eln(e*) = 7?. 

(ce) e~2® = eM) = x? (or 1/22). 

(f) exp(2Inz + In(z + 1)) = exp(In(a? x (c+ 1))) = 2?(@ + 1). 
An alternative argument gives 


exp(2Inz + In(x + 1)) = exp(In(x’)) x exp(In(x + 1)) = 2?(a + 1). 


Solutions to exercises 


Solution to Exercise 12 
Using equation (30) we have 
A = em ind _ e0:0562 = 157.0 
and 
B= e7 4-315 In 10 = e9-9357 = 4.842 x 1075 


to four significant figures. 


Solution to Exercise 13 


(a) Note that the question asks us to use Figure 15, not read off values 
from Figure 16! The figure given here shows a circle of radius 1. The 
point A corresponds to a rotation through 0, and the point B 1B 
corresponds to a rotation through 5. 


We see from the figure that A has coordinates (1,0) = (cos0, sin 0), 
and B has coordinates (0,1) = (cos $,sin $). Therefore 


snQ0=0, cosO=1, sinf=1, cos$=0. 1 
(b) Since sin0 = 0, cosecO and cot 0 are not defined. We have 
tan0=O and secO=1. 
Since cos } = 0, tan $ and sec 5 are not defined. We have 
cot} =0 and cosec> =1. 


(c) Using the triangles in Figure 17, we obtain 


sin 5 = oe sin? = ¥3, sinZ =}, 
cos 5 = Wot cos$= 3, cosz= v3 
m m nm 1 
tanZ=1, tang = 73, tan % = Ji 
x me 1 z 
cotZ =1, cot 5 = =) cot Z = V3. 


(d) From Figure 16, sin@ = 0 for 0 = 0 and 6 = 7, and for any value of 0 
that differs from 0 or a by any integer multiple of 27. Hence we 
conclude that sin # = 0 if 6 = nz, where n = 0,1, +2,... (i.e. n is any 


integer). This can be written as n € Z, where Z is the set of all 
integers. 


The symbol € means ‘is a 
member of’. 


Solution to Exercise 14 


(a) Using a calculator, one solution is arcsin0.8 = 0.93 (to two decimal 
places). Looking at the graph of y = sin@ shown below, we see that it 
is symmetric about 6 = 5. So there is also a solution of sin @ = 0.8 at 
6 = m — 0.93 = 2.21 (to two decimal places). These are the only 


solutions with 6 between 0 and 27. (If @ is between 7 and 27, then 
sin 0 is negative.) 


The other solutions are obtained by adding multiples of 27 to these 
two. The solutions in the required range are (to two decimal places) 


0.93, 2.21, 7.21, 8.50, 13.49, 14.78. 


69 


Unit 1 Getting started 


(b) We saw in Exercise 13(c) that tan = 1, so 0 = 4 is one solution. We 


can see from the graph of tan (Figure 18) that there is one solution of 
this equation in the range —} to 3, and that other solutions are 
obtained by adding multiples of 7 to this. We can express the full set 
of solutions as 


ent, 
where n is any integer (positive, zero or negative). 
Solution to Exercise 15 
(a) Dividing equation (32) by cos? @ gives equation (35). 
(b) Dividing equation (33) by equation (34) gives 


sin cos @ + cos @sin @ 


t 0 ei es 7 aan eT a ae 
eG cos 0 cos ¢ $ sin sin @ 


Then dividing each term in the numerator and denominator of the 
right-hand side by cos @cos ¢@ gives equation (37). 


(c) Using equation (34) we obtain 
cos(@ — ¢) — cos(0 + ¢) = 2sin sin ¢, 
which establishes the result in equation (38). 
(d) Setting ¢ = 6 in equation (33) and taking the upper signs gives 
equation (41). 
Solution to Exercise 16 
(a)  sin(2a — 8) = sin 27 cos 6 — cos 27 sin 8 
=0 x cosé—1 x sind = —sin#@. 
(b) sin(%—6) =sin 3 cos@ — cos % sind 
= 1x cosé—0 x siné = cosé. 


For 0 < @ < 4, this result can be confirmed by examination of a 
right-angled triangle. 


70 


(c)  sin(a — 6) = sin cos @ — cos7sin0 

= 0 x cosd— (-1) x sin @ = sind. 
(d) cos(m— 6) = cosmcos@+ sin7sin 0 

= (-1) x cos8+0 x sin@ = —cos6. 


(e) cos(2a — 0) = cos 27 cos 6 + sin 27 sin 6 
= 1x cos?+0 x sin@ = cosé. 
(f) cos(%— 0) =cos¥ cos + sin J sind 
=0xcos#+1 x sin#@=sin@. 


For 0 < @ < 4, this result can be confirmed by examination of a 
right-angled triangle. 


37 


(g) cos (37 +2) =cos cosa — sin 


2 
= 0 x cosx — (-1) xX sing = sing. 


sin xz 


Solution to Exercise 17 

(a) V=3+4 41. 

(b) |v] = V32 + 42 = 5. 

(c) v—w = (3 — 42) — (2-7) = 1-31. 

(d) vw = (3 — 44)(2 — i) =6 — 81 — 31+. 477 = 2-11. 
w wd (2-1)(3+4i) 


‘) Se ae 
6 — 31+ 81 — 47 
= 
= 24+ Ri=24+hi. 


1 ww 2+i 5 3G. 
w wR 4+ ot 

(g) w? = (2—-i)(2-—7) =4-2-2+7=3- 4. 
(h) 2w —3v = 4— 24 — (9-124) = -5 + 101. 


Solution to Exercise 18 


We obtain 


-24+VF-8 -242i , 
CSS — = = 77H 


4 4 


Nie 
=. 


Solutions to exercises 


71 


Unit 1 Getting started 


72 


Solution to Exercise 19 


The points are illustrated in the figure below. 


YA 
(0, 4)@ 
(-v3, 1) (1, 1) 
- g i - 
(—2, 0) (4,0) * 
a 
(=1 =) 


Using trigonometry in this diagram, the polar coordinates of these points 
are as follows: 


(x, y) = (—2,0) corresponds to (r, 0) = (2,7), 
(x,y) = (1,1) corresponds to (r, 9) = (V2, 5), 
(x,y) = (—1,—-1) corresponds to (r, 0) = (V2, —32), 
(x,y) = (4,0) corresponds to (r, 0) = (4,0), 

(x,y) = (0,4) corresponds to (r,@) = (4, 3), 

(x,y) =(-— V3, 3, 1) corresponds to (r, 8) = (2,7 — ¢), 


where ¢ is as shown in the figure. 


TT 


Now tan ¢ = wat so @ = % (see the solution to Exercise 13(c)). So 
(x,y) = (—V3, 1) has polar coordinates (r, 0) = (2, or). 


Solution to Exercise 20 
2 = 2 (cos (—4) + isin (-4)) =2 (4 — Hi) = VAL - 8), 


Solution to Exercise 21 


(a) For all these numbers, we can use the diagram accompanying the 
solution to Exercise 19. 


—2 has polar coordinates (r,@) = (2,7). (Note that r must be positive, 
and the principal value of the argument must be in the range 
—7<0<7T.) 


Writing the polar form in full, we get —2 = 2(cosm+ isin7). 


(b) 1+ has polar coordinates (r, 0) = (V2, z). 
So 1+ 4 = V2 (cos (4) + isin (4)). 

(c) —1—i has polar coordinates (r, 0) = (V2, —#2). 
So —1—i = V2 (cos (—37) + isin (—32)). 

(d) 4 has polar coordinates (r,@) = (4,0). 
So 4 = 4(cos(0) + isin(0)). 

(e) 4é has polar coordinates (r,@) = (4, 3). 
So 44=4 (cos (5) +7sin (4)). 

(f) —V3 +i has polar coordinates (r,0) = (V3 +1,7— 2) = (2, %). 
So —V/3 +i = 2 (cos (37) + isin (34)). 

Solution to Exercise 22 


(a) The modulus of z is 3, and the argument is —77/2. We need to find 
the principal value of the argument, which means that we need to add 
some multiple of 27 to it so that it lies in the range —7 < 0< 7. 
Clearly —77/2 + 4 = 7/2 lies in this range, so 


z= 3e—ttn/2 = Zeit /2. 


(b) The argument is 717, which is equivalent to 71a — 707 = 7. So 


z= eviln = eT. 


Solution to Exercise 23 


The exponential form of z is z = re’’, so 


zeivt — peiO pivt — pei(wtt9). 
Taking the real part of this expression, we obtain 

Re(ze™*) = Re(reit+9) = rcos(wt + 6). 
Solution to Exercise 24 


The polar coordinates of z = 1 — i are (r,0) = (V2, —7), so the 
exponential form of z is 


z= V2e7"/4, 
Hence 
720 _ (/aenisy)™ 
= (V2) i207/4 
— 910,—5im 
= 1024 e’”, 
where we have added 67 to the argument of the exponential to obtain the 


principal value in the last line. Returning to Cartesian form by using 
Euler’s formula, we obtain 


(1 — 7)?° = 1024(cos 7 + isin) = —1024. 


Solutions to exercises 


73 


Unit 1 Getting started 


Solution to Exercise 25 


Using the Handbook result 

+ (cos(a)) = —asin(az) 

with a = 3 and « replaced by t, we get 

. dx 
"Ut 

To differentiate again, we use 


= —21 sin(3t). 


a (sin(az)) = acos(ax) 


dx 
to obtain 
d2 
t= at = —63cos(3t). 


Solution to Exercise 26 
We need the Handbook result 


d 
ame) = ue". 
The rate at which the wage bill will be rising is 
dB 
a 10°(0.04) exp(0.04t) = 4000 exp(0.04t). 


As a fraction of the future wage bill B, the rate of rise dB/dt is 


1 dB _ 10°(0.04) exp(0.04t) _ Ana 
Bdt  — 10° exp(0.04t) 


So the rate of rise is 4% per year. 


Solution to Exercise 27 


(a) a = (—9)(—5) exp(—5az) = 45 exp(—5z). 
(b) F’(x) = 1223 — 4, so putting x = 2 gives 
FQ) = 12% PF = 4= 99. 
BD on Os” 
t dt? aa 
(d) g'(t) = —3asin(3t) + 3bcos(3t), and then 
g(t) = —9a cos(3t) — 9bsin(3t), 
so 
g (0) = —9a. 
(e) F’(x) = 6sec(2x) tan(2x) — 12 sin(—3z). Hence 
F'(2) = 6sec % tan $ — 12sin (—) 


=6x2x 73-12 x (-1) = 12V3+4 12. 


74 


Solutions to exercises 


Solution to Exercise 28 


Given f(t) = cos(2t) + isin(2t), we have f’(t) = —2sin(2t) + 27 cos(2t) 
then 


f" (t) = —4cos(2t) — 47 sin(2t). 
Writing f(t) = exp(2it), we have f’(t) = 2iexp(2zt) and 
f(t) = (21)? exp(2it) = —4 exp(2it) = —4cos(2t) — 4i sin(2t), 


as before. 


Solution to Exercise 29 


(a) The function y = (Inx)/(x? + 1) is a quotient. Using standard 
derivatives from the Handbook, we obtain 


dy _ (+) (2? + 1) — (Inz)(2z) 


dx (x? + 1)? 
_ g+a71—2¢lne 
(x? + 1)2 
This can be simplified by multiplying the numerator and denominator 
by a: 
dy _ ge? +1— 227 Inez 
dx (a2 +1)? 


(b) The function f(t) = ¢° In(3t) is a product. Recalling that 
d 
—(In(axz)) =1/xz for ax > 0, 
da 
we get 
il 
f'(t) = 5t* In(3t) + 4° x = 5t* In(3t) + ¢*. 
c) The function g(t) = (At+ B) sin(Ct) is a product. We get 
(c) 
g(t) = Asin(Ct) + (At + B)C cos(Ct), 
so g'(0) = BC. 


(d) Given a(t) = e~** sin(4t), we want to find z(t) and #(t). Using the 
product rule: 


z(t) = —3e~** sin(4t) + e~** (4 cos(4t)) 
=e *'(4cos(4t) — 3sin(4t)). 
Using the product rule again: 
#(t) = —3e~**(4 cos(4t) — 3sin(4t)) + e~**(—16 sin(4t) — 12 cos(4t)) 
= —e~*(7sin(4t) + 24 cos(4t)). 


75 


Unit 1 Getting started 


Solution to Exercise 30 


(a) In each case, we indicate the split into inner and outer functions but 
(except in this part) omit details. It is perfectly acceptable to do the 
differentiation in your head, using the product of the derivatives of 
outer and inner functions. 


Taking the inner function to be u = ¢?, the outer function is 
y(u) = exp(u). We have dy/du = exp(u) and du/dt = 2t, so 
dy dy du 9 
Se 2t = 2t t*). 
Wt ~ du di 7 xP) exp(t*) 
(b) Taking the inner function to be u = 32° +4, the outer function is 
f(u) = u®, so 
f'(x) = 6(3a3 + 4)° x (9x?) = 54x?(3a? + 4)°. 
(c) Taking the inner function to be u = 3u + 4, the outer function is 
z(u) = tan u, so 


d 
= sec”(3u + 4) x 3 = 3sec?(3v 4+ 4). 


du 
(d) Taking the inner function to be u = 4 — 2”, the outer function is 
g(u) = u'/?, so 
g'(z) = 3(4— 22)-¥? x (-22) = 
V4 — 2? 


(e) Taking the inner function to be u = 1 + 2x7, the outer function is 
f(u) = u-/? (see Example 2), so 
—6x 


Fa) = 3 (1428) 0" x (da) = ee 


Solution to Exercise 31 

(a) sec(a/(x? +1)) is a composite function, with 
x 

a1 

Here wu is a quotient, and 


du 1(a?+1)-2(22) 1-2? 


y=secu, Uw= 


dx —— (a2 +:1)2—s (x2. +1)?” 
Then, using the chain rule, 

dy dy du 1-2? 

de = di ae => eee ae ye 


1-2? x ba aC 
= —___—__'§ —— n| —— ]. 
(x? + 1)? NZ 41 x?+1 


(b) t? exp(t? + 1) is a product of u = ¢? and v = exp(t? + 1). The second 
part of the product is a composite function with inner function 
g =t?4+1 and outer function f = expg. The chain rule gives 


d 
> = exp(t? +1) x 34? = 3t? exp(t? + 1). 


76 


Then, using the product rule, 


d 
— = 2t exp(t? +1)+ Pr exp(t? +1)) 
= (3t* + 2t) exp(t? + 1). 


Solution to Exercise 32 
(a) (i) The product rule gives 


d/ 9 2 dy 
aa =2 a 
aa (x*y) = 2ay+ 2 in 
(ii) The composite rule gives 
Cn 9 dy 
— (y?) = 3y? 2. 
qt ) = 3u" 7. 
(iii) The composite rule gives 


IG + sin(xy)) = 1+ cos(zy) © tg 


dx dx 
dy 
=1+cos(ry)|y+r—]}. 
dx 
(b) Using implicit differentiation, we obtain 
d d 
3a? + Qey + a? 4% 4 3y2 SY = 0. 
dx da 
When xz = —1 and y = 1, this gives 
dy |, dy 
3-24+—+3—=0. 
7 da mr dx 


L 
a 


Hence dy/dz = —t, and the required gradient is — 
Solution to Exercise 33 
(a) To test for stationary points, use the product rule to find 
y' (x) = —2e~*/* — (2(a + 1)) x (—4e7*/?) 
= (-2+ (e+ 1))e"*”? 
= (x — 1)e*/?. 
So y’(x) = 0 only at « = 1. This is the only stationary point. 
(b) To classify the stationary point, we differentiate again: 
y" (x) =e 7/2 + (2-1) (—3) eo t/? — $(3— a)e?/?, 
At the stationary point, y/’(1) = e~!/? > 0, so the stationary point is a 
local minimum. 


Alternatively, we see that the first derivative y’ is negative if x < 1 
and positive if x > 1, so the first derivative test confirms that it is a 
local minimum. 


The value of the function at the stationary point is 
y(1) =5 — 4e~1/? = 2.574 (to four significant figures). 


Solutions to exercises 


7 


Unit 1 Getting started 


Solution to Exercise 34 


(a) (i) The denominator 4 + 1.5v + 0.008v? is positive for all v > 0, so 
f(v) is defined for all v > 0. 
(ii) f(v) =0 only when v = 0. 
(iii) As v > oo, f(v) 3 0. 
(iv) To find any stationary points, differentiate f(v) using the 
quotient rule, to obtain 
Hv) (4 + 1.5v + 0.008v2) — v(1.5 + 0.016v) 
Vv ooo 
(4+ 1.5u + 0.00802)? 
4 — 0.008v? 
(4+ 1.5v + 0.008v?)?° 
The stationary points occur when 4 — 0.008v? = 0, i.e. at 
v = +V500 = +22.36 (to two decimal places). The negative 


stationary point is outside the domain (v > 0), so we need 
consider only v = 22.36. 


For v < 22.36, f’(v) > 0, while for v > 22.36, f’(v) < 0. 
Therefore v = 22.36 is a local maximum. At this point, 
f (22.36) = 0.538 (to three decimal places). 


(b) A graph of f is shown below. 
f(v)t 
0.538 


0 22.36 r 
From the graph we see that the global maximum of f occurs at the 
local maximum, i.e. at v = 500 = 22.36. The global minimum occurs 
at the endpoint of the domain, i.e. at v = 0. 

Solution to Exercise 35 

(a) Using the product rule for derivatives, 


d 


x ae 
= (- cos(ax) + - sin(aa) + c) 


1 x : a 
ae cos(ax) + (-=) [—asin(ax)] + a cos(ax) 
= psn), 
Therefore 
1 
[ esin(az) f= = cos(ax) + =) sin(az) + C, 


so verifying the given integral. 


78 


Solutions to exercises 


(b) Using the composite rule for derivatives, and noting that cos(axz) > 0 
for —1/2 < ax < 1/2, we have 


Fe (—{In(cos(ax)) + €) ={ *) t © (eos(act) 


a cos (air) da 
econ 
= tan(az) 


Therefore 
1 
[ ton(az) dx = ——I|n(cos(ax)) + C, 
a 
so verifying the given integral. 


Solution to Exercise 36 


Using the given identity, we have 
1 1 1 
[ c0s*(ax) d= 5 fo + cos(2az)] dx = 5 (« + or sin(2ar) +C 


1 1 
= 5° + ia sin(2azr) + C. 


Solution to Exercise 37 


(a) fe = te" er: 


b) | 6sec?(3t) dt = 6 x 4 tan(3t) + C = 2tan(3t) + C. 
3 


(c) This does not quite match a Handbook integral as it stands. We have 
36 + 4x? = 4(9 + 2”), so 


1 1 ; 1 
— = es == —— qd 
lao laa” ‘fam ° 


= $ ($ arctan (4 


” 


3U 
= Aen v) +C 


(d) For y < 3, the integrand is positive, and we have 


1 4 


(e) For y > 3, the integrand is negative, and we have 


1 
[y= F in + 2) +0 = -Fmy 3) +0. 


Solution to Exercise 38 
(a) (6cos(—28) + Bain(4#)) = Sein 98) — Deos(4d) +6 


Note that you can use cos @ = cos(—6) to simplify the integrand (or 
use sin 9 = — sin(—@) to simplify the result) to obtain 
3sin(2t) — 2cos(4t) + C. 


79 


Unit 1 Getting started 


(b) For -3 <t <3, 


1 
———. dt = arcsin (4t) +. 
las (st) 
(c) Fort <0; 
8 
[ae f (5847) a= $e + 7in-y +e. 
(d) For é> 0, 


(e) Choose a = 1, b= —1. Then b < <a and we can use the standard 
integral 


fey (SS) +C. 


Solution to Exercise 39 
Using v? — k? = (v—k)(v +k), and taking a = k and b = —k in the 


Handbook entry for / dx, with b < x < a, we obtain 


1 
(x — a)(a — b) 


[a-e®- | woaen® 
1 nf 


1 k- 
=x m(¢ z) +e (-k<u<k). 


Solution to Exercise 40 
(a) Ifu=2+ 4y?, then du/dy = 12y?. So 


[ Powe +4y°) dy = of Dau 
dx 
=.) edu 


(b) If u =siny, then du/dy = cosy. So 
[ cosusin® ydy = fe ahs, 
dy 


= [eau 


=ju°+C=ssin®?y+C. 


80 


(c) Ifu=1-?, then du/dt = —2t. So 


[virea=-3 [| vo Zar 


4 


- 4 [wa 


=—3(3u°) + 
—-1(/1-#)°?+C. 


(d) Ifu=1+2?, then du/dx = 2z. So 
g 1 du 
— dr= i J ieee 
[em ™ pf Sas 


1 


In(u) +C = $In(1 +27) +C. 


2 
In this case, there is no problem with the domain of the In function 
because u = 1+ 2? > 0. The answer can also be obtained directly 
from equation (82) because the integrand is proportional to g'(x)/g(x) 
with g(x) > 0. 

(e) Ifu=1+sin*t, then du/dt = 2sint cost = sin 2t, using the 
trigonometric identity for sin 2t (equation (41)). So the integrand is of 
the form g'(x)/g(x) with g(a) > 0. Hence equation (82) gives 


in 2¢ 
|= = In|1 +sin?¢| + C = In(1 +sin?t)+C. 
1+sin*t 
(f) Using equation (82) with g(y) = 1—y?, so that g’(y) = —2y, we have 
(for Yy x +1) 


=-gln|l—y?|+C. 


Solution to Exercise 41 


Taking u = 3x + 1, we have du/dzx = 3, so we can take 
du = (du/dx) dx = 3dz, and dx = du/3. Also, x = (u — 1)/3, so 


Qn? +1 = (u—1)? +1 =u? -2u4+2. 
Putting everything together gives 


Or? +1 q? — y+ 2 du 
i= dz = a 
324+1 U 3 


2 
=4f (u-24?) au 
u 


=} (Su? — 2u+2Inu) +C 


= ¢(3a + 1)? — 2(8a +1) + 2n(8x+1)+C. 


There is no problem with the domain of the In function in this case 
because 32 +1 > 0. 


Solutions to exercises 


81 


Unit 1 Getting started 


Solution to Exercise 42 
(a) Take f(x) = x and g/(x) =e"*. Then f’(z) = 1 and g(x) = —e~*. So 


foo dx = «(—e~*) — / 1x (-e7*) da 


=-—gze * + fee dz 


=-re*-e *4+C 
=-(#+1e74+C. 


(b) Take f(x) = x? and g’(x) = e~*. Then f’(x) = 2 and g(x) = —e-*. 
So 


la dia" (=e 7) = fxce) dx 
pee 2 f xe dz. 
Using the result of part (a), we get 


pa da = —a*e"* + 2(-(2 + le 7 +) 
= —(2* + 27 +2)e"* + B, 
where B = 2C is an arbitrary constant. 


(c) Take f(x) = sina and g'(x) = e® (the other way around also works). 
Then f’(x) = cosz and g(x) = e*. So 


fesinzds =e sine ~ | e*cosrde. 


We now evaluate the integral on the right-hand side by parts, taking 
f(x) = cosz and g’(x) = e”. Then f'(x) = —sinz and g(x) = e”. So 


i cos «dx = e* cos x + fe sin x dx. 
Putting the two results together, we get 

Je sin x dx = e” sinx — e” cosx — Je sin x dx, 
and rearranging gives 


fe sin x dx = $e" (sinz — cos 2). 


Solution to Exercise 43 


(a) f (@—2)de = [fe* — 225 = (4 2) — (0-0) =—f. 
(b) [ 2-2) ae = [}e" 2] ee eee) eae 


2 
(c) / (x? — 2) de = [324 — 22]5 = (18 — 4) - (0-0) =0. 
This integral is the sum of the integrals in parts (a) and (b). 
82 


Solution to Exercise 44 


3/2 4 3/2 l 
—___q =;/ —_—_ 
/ 9442 °° 4 9 9/442? ° 


=—1 : arctan - 

~ ae 3/2) | 

= ;(arctan 1 — arctan 0) 

—_ 1 f(r — OF 

=; (| -0) =H. 
Solution to Exercise 45 


Make the change of variable u = 37a, so that x = u/37 and 

du = (du/dx) dx = 31 dx. Expressing the limits of integration in terms 
of u, we see that the lower limit is u = 0, while the upper limit is u = 37. 
Hence 


= 7 
it 3 u=3T 
= (+) / u? sin? u du. 
37 u=0 


We can then use the given integral to conclude that 


ff 1) GP 3n) 1 1 
A 3r) 8 \ 8a 6 4) 6 3672 


Solution to Exercise 46 


(a) First notice that f(x) = e~® sin(x) is an odd function of x because 
f(-—x) = e (2) sin(—2x) = = sin(x) = —f(2). 
Since the range of integration (from —7z to +7) is symmetric about the 
origin, the integral vanishes. 


(b) The integrand is odd because 
f(-a) = -asin(1 + (-2)*) =—f(2). 


Since the range of integration is symmetric about the origin, the 
integral vanishes. 


(c) We write the integral as 


1 1 
y ie cos(a’) dr + f xz cos(x?) dx. 


-1 -1 
The second integral vanishes since the integrand is odd and the range 
of integration is symmetric about the origin. 


The first integral can be evaluated by changing the variable to u = 2°, 


so that du = 3x? dz, and the two limits become u = —1 and u = 1. 


Hence 


u=1 
f= 1/ cosudu = [bsinu] =", = 3 sin(1) ~ 0.561. 
u=—-1 


Solutions to exercises 


83 


Unit 1 Getting started 


84 


Acknowledgements 


Grateful acknowledgement is made to the following sources: 

Figure 1: NASA. 

Figure 2: Taken from www.astro.le.ac.uk/users/gwy /images/nano. 
Figure 23: Taken from http://wrongsideofmemphis.com/tag/kepler. 


Figure 24: M.V. Berry and S. Klein (1996) Coloured Diffraction 
Catastrophes, National Academy of Sciences. 


Figure 32(b): Image Editor / www.flickr.com. 


Every effort has been made to contact copyright holders. If any have been 
inadvertently overlooked, the publishers will be pleased to make the 
necessary arrangements at the first opportunity. 


Unit 2 


First-order differential equations 


Introduction 


This unit introduces differential equations. This topic is of central 
importance in physics because many physical laws are expressed as 
differential equations. Differential equations also appear in many other 
areas of science and applied mathematics. This introduction gives a brief 
outline of the topic. 


We begin by discussing the solution and derivation of a simple differential 
equation, namely 


f= —A f(a), (1 


where A > 0 is aconstant. This is called a differential equation because it 
contains a derivative of an initially unknown function f(x). It is solved by 
finding a function f(a) that satisfies the equation. You will see how it is 
solved, and how it arises in two different physical situations. This 
illustrates several important points about differential equations in quick 
succession before the body of the unit discusses them at a slower pace. 


In some cases differential equations can be solved by guessing the solution. 
Noting that equation (1) is very similar to equation (19) of Unit 1, we try 
the function f(x) = exp(—Az). The derivative of this function is 


af 
dx 


However, we assumed that exp(— Az) = f(x), so this shows that our guess 
satisfies 


df 


which is the differential equation that we wished to solve. 


= —Aexp(—Az). 


The same reasoning works if this trial solution is multiplied by any 
constant: if f(z) = Bexp(—Az), then df/dz = —ABexp(—Az) = —A f(z). 
So we see that 


f(x) = Bexp(—Ar) (2) 


is a solution of the differential equation (1) for any value of B. The reason 
why the trial solution works in this case is easy to understand. The 
exponential function exp(x) has the property that when you differentiate 
it, you get the same function back again. The differential equation (1) 
describes a function with almost that property, so it is natural to seek a 
solution of this equation in the form of equation (2). 


You have now seen an example of a differential equation and its solution. 
But how does this differential equation arise in applications? We will 
discuss two examples. 


Equation (19) of Unit 1 is 


dexp(x) 
da 


Introduction 


= exp(z). 


87 


Unit 2 First-order differential equations 


Figure 1 Intensity of light I 
in the sea at depth x 


88 


First, we consider a model for light penetrating into the ocean depths. 
Sunlight is absorbed in seawater, and almost no light reaches the deepest 
parts of the oceans. If you were involved with underwater engineering or 
investigating the behaviour of plankton, which can adjust their depth 
depending on the light intensity, you might wish to understand how the 
light intensity decreases as the depth increases. Let the intensity of light 
be I(x) at depth x below the surface (Figure 1). 


We would like to be able to find the function I(x). This is done by finding 
a differential equation satisfied by I(x). The differential equation is found 
using the following argument. 


Consider what happens as we go from a depth x, passing through a thin 
layer of seawater of thickness 6x to a slightly larger depth x + da. (It is 
conventional to write the small change in any quantity X as 6X, where 6 is 
the Greek letter ‘delta’). On passing through this thin layer, the intensity 
of light changes by an amount 6/, due to absorption of light. This change 
is negative because the light intensity is reduced. We might reasonably 
expect that the amount of light absorbed is proportional to the amount of 
light entering the layer. Also, the amount of light absorbed by a thin layer 
is expected to be proportional to the thickness of the layer, so 


ol = —Al oz, 


where A is a constant (called the absorption coefficient) that describes 
how effectively seawater absorbs light. 


Now divide both sides of this equation by 6z, to obtain 
ol 
oo 

It is assumed that 6x is small. If we take the limit as dx tends to zero, we 

get a differential equation for I(x): 


dl 
= AM@): (3) 


This is the same as the equation we have already solved, apart from a 
change in the name of the function from f(a) to I(#). The solution is 
therefore 


I(x) = Bexp(—Az), 


—Al. 


for some constant B. This solution is plotted in Figure 2 for A = B =1. It 
has the expected property that the intensity J decreases as the depth x 
increases. The decrease is rapid in the upper layers but more gradual in 
the murky lower depths. A function of this type is said to exhibit 
exponential decay. 


Introduction 


Sy 


Figure 2. The graph of I(x) = Bexp(—Az) for A= B=1 


Notice that a complicated situation has been made much simpler by 
considering a very thin layer, in which the absorption of light can be taken 
to be proportional to the thickness of the layer. This approximation is 
valid for a thin layer but not for a thick one (as shown by the non-linear 
shape of the graph in Figure 2). The beauty of this approach emerges 
when we imagine the layer becoming infinitesimally thin. In this limit, the 
approximation becomes exact and, at the same time, we obtain an 
equation involving derivatives, that is, a differential equation. So here is 
mathematical alchemy — we start with an approximation that is reasonable 
only on a small scale, and use it to obtain a differential equation that 
avoids this approximation and has solutions that apply on a large scale. 


Ba 


Figure 3. An anglerfish 
creates light to attract its 


prey 


Radioactive decay provides another context in which differential 

equation (1) appears. In this case, atoms of a radioactive substance 
‘decay’; that is, they release radiation and are transformed into other types 
of atom. 


89 


Unit 2 First-order differential equations 


The term ‘continuous variable’ 
here means that N € R. We are 
also assuming that N is large 
and positive. 


Figure 4 Skull of a smilodon 


90 


At time t, a given piece of radioactive material contains N ‘parent’ atoms 
(where N is a very large number). Because the number of atoms is very 
large, we will treat N as a continuous variable, rather than an integer. The 
radioactive ‘parent’ atoms are unstable and can break up to produce a 
‘daughter’ atom and an energetic particle. By considering small intervals 
of time dt, it can be shown that the differential equation for the number of 
parent atoms as a function of time is 

dN 

dt = AN, 
where A is a constant that depends on the parent atom type. This is the 
same equation as equation (1), with different names for the symbols, and 
the solution can be found from equation (2) by changing the names of the 
symbols. The solution is therefore 


N(t) = Bexp(—At). 
Setting t = 0, we get N(0) = B, so we identify the constant B with the 


number of atoms present at time t = 0, i.e. the initial number of parent 
atoms at t = 0, which we denote by No. Hence 


N(t) = No exp(—At). (4) 


The constant A is called the decay constant. It is large for types of parent 
atom that decay rapidly. The decay rate is often expressed in terms of the 
half-life of the atoms, T,/2, which is the time taken for half of the parent 
atoms to decay. To determine the relation between the decay constant and 
the half-life, set 


1  N(T\/2) 
Sy Pa 
Taking the logarithm of this gives In(1/2) = —AT}/2, so 
In2 
Thj.2=—. 
or 


Radiocarbon dating 


Atoms of carbon-14 are produced in the upper atmosphere by 
collisions between cosmic rays and molecules. They are unstable, and 
decay with a half-life of approximately 5644 years. Carbon-14 is 
absorbed by an organism when it is alive, and decays after the 
organism has died. By measuring the ratio of the carbon-14 atoms in 
a dead organism to those in a living one, the number of years since 
death can be estimated, using equation (4). For example, Figure 4 
shows the skull of a smilodon (a sabre-toothed cat that was larger 
than a modern tiger). Radiocarbon dating of specimens like this 
reveals that such cats died about 10000 years ago. 


1 Some basics 


These examples of light intensity and radioactive decay illustrate some 
significant points. 


e The two examples gave rise to essentially the same differential 
equation, just with different symbols. You need to be able to recognise 
when a differential equation is of the same form as one that you have 
seen before. 


e In each case, the differential equation was obtained by considering 
small changes in physical variables (62 and dt). Similar steps are used 
in the derivation of many differential equations. However, you will not 
be asked to derive differential equations in this module; instead, we 
will focus on ways of solving them. 


e The solution was obtained by guessing a suitable function. Judicious 
guesswork can help us to solve some differential equations, but you 
will see that systematic methods allow us to solve many more. 


Study guide 


This unit introduces some basic ideas about differential equations, before 
considering in detail first-order differential equations, so called because 
they involve only derivatives of first order. 


Section 1 gives an example of how a differential equation arises in a 
mathematical model. It also introduces some basic definitions and 
terminology associated with differential equations and their solutions. 


Sections 2 and 3 develop exact methods for solving first-order differential 
equations of various special types. From the point of view of later studies, 
these sections contain the most important material in this unit, which will 
be built on later in the module. 


Some integrals cannot be evaluated by hand. In a similar way, many 
differential equations cannot be solved exactly. This unit therefore ends by 
looking at qualitative methods in Section 4 and numerical methods in 
Section 5. These sections will not be tested in continuous assessment or in 
the exam, but it is important that you read them in order to gain an 
insight into what progress can be made when exact solutions of differential 
equations lie beyond our grasp. 


1 Some basics 


The Introduction gave a quick and sketchy overview of the subject of 
differential equations in order to give you some feeling for the scope and 
significance of this vast topic. Here we start again from the beginning, 
developing the subject at a slower pace. 


91 


Unit 2 First-order differential equations 


92 


An important class of equations that arises in mathematics consists of 
those that feature the rates of change of one or more variables with respect 
to one or more others. These rates of change are expressed mathematically 
by derivatives, and the corresponding equations are called differential 
equations. Equations of this type crop up in a wide variety of situations. 
They are found, for example, in models of physical, astronomical, 
electronic, economic, demographic and biological phenomena. 


First-order differential equations, which are the particular topic of this 
unit, feature derivatives of order one only; that is, if the rate of change of 
variable y with respect to variable x is involved, the equations feature 
dy/dx, but not d?y/dx?, d>y/dx? or any higher-order derivatives. In many 
cases we consider the rate of change of a variable with respect to time, but 
we also discuss differential equations where the independent variable is 
position, or occasionally some other variable. 


When a differential equation arises, it is usually an important aim to solve 
the equation. For an equation that features the derivative dy/dzx, this 
entails expressing the dependent variable y directly in terms of the 
independent variable x. The process of solution requires the effect of the 
derivative to be ‘undone’. The reversal of differentiation is achieved by 
integration, so it is to be expected that integration will feature 
prominently in the methods for solving differential equations. The solution 
can be attempted symbolically, to get an exact formula, or numerically, to 
get approximate numerical values that can be tabulated or graphed. 


Subsection 1.1 describes a situation that leads naturally to a first-order 
differential equation — in this case, one that is slightly more complex than 
equation (1). 


Subsection 1.2 then introduces some important terminology. It explains 
what is meant by the term ‘solution’ in the context of first-order 
differential equations, and brings out the distinction between the general 
solution and the various possible particular solutions. The specification of 
a constraint (in the form of an initial condition) usually allows us to find a 
unique function that is a particular solution of the differential equation 
and also satisfies the constraint. 


1.1 Where do differential equations come from? 


To illustrate how differential equations arise, we consider an example 
drawn from biology. 


Suppose that we are interested in the size of a particular population, and 
in how it varies over time. The first point to make is that the size of any 
population is measured in integers (whole numbers), so it is not clear how 
differentiation will be relevant. Nevertheless, if the population is large, say 
in the hundreds of thousands, a change of one unit will be relatively very 
small, and in these circumstances we may choose to model the population 
size by a continuous function of time. This function can be written as 
P(t), and our task is to show how P(t) may be described by a differential 
equation. 


1 Some basics 


Let us assume a fixed starting time (which we label t = 0). If the 
population is not constant, then there will be ‘joiners’ and ‘leavers’. For 
example, in a population of humans in a particular country, the former are 
those who are born, or who immigrate into the country, while the latter 
are those who die or emigrate. For our simple model we will ignore 
immigration and emigration, and concentrate solely on births and deaths. 


In the small period between t¢ and t + 6t, it is reasonable, as a first 
approximation, to expect the number of births to be proportional to the 
population size P(t) at time t, and to the time interval dt. A similar 
argument applies to the number of deaths, so we can write 

number of births ~ b P(t) dt, 

number of deaths ~ c P(t) dt, 


where 6 and ¢ are positive constants known as the proportionate birth rate 
and the proportionate death rate, respectively. 


The change 6P in the population over the time interval dt is the number of 


births minus the number of deaths in that interval. So we have Births appear with a positive 
sign because they add to the 

6P =~ b P(t) dt — c P(t) dt population; deaths appear with 
= (b—c) P(t) ot. a negative sign because they 


subtract from the population. 
Dividing through by dt, we obtain 


ae 

a 
The approximations involved in deriving this equation become 
progressively more accurate for shorter time intervals. So, finally, by 


(b—c) P(t). 


letting d¢ tend to zero, we obtain This is the step that requires P 
dP to be a continuous (rather than 
ae = (b—c) P(t). (5) discrete) function of t. 
t 


This is a differential equation because it describes dP/dt rather than the 
eventual object of our interest (the function P(t) itself). 


We can simplify the above equation by introducing the proportionate 
growth rate r, which is defined to be the difference between the 
proportionate birth and death rates: r = 6 — c. Then our model becomes 


dP _ 
dt 
where r is a constant (which may be positive, negative or zero). 


rP, (6) 


Equation (6) may look familiar: it is essentially the same as the equation 
discussed in the Introduction (the only difference being the sign on the 
right-hand side). So it should not surprise you that the solution is of the 
form 


Pi) = Pyexp(rt), (7) 


where Pp is the initial population at time t = 0. 


93 


Unit 2 First-order differential equations 


0 M P 


Figure 6 A plot of r(P) as 
given by equation (8) 


94 


For r > 0, this leads to the prediction of an exponential growth in 
population; for r < 0, it predicts an exponential decay in population; and 
for r = 0, the population is predicted to remain constant. These three 
possibilities are sketched in Figure 5. 


P P P 


HY 
ae A 
+v 


(a) r>0 (b) r<0 (co) 7 =0 


Figure 5 Population models obtained from equation (6), where r is a 
constant 


From equation (6), we see that the proportionate growth rate r is given by 


1 dP 
poe 

P di’ 
and up to this point, we have assumed this to be constant. However, this 
is not entirely realistic. For example, if r is a positive constant, then an 
exponential increase in population may be sustained for a while, but we 
know that such growth cannot go on forever. When the population is low, 
we may assume that there is the potential for it to grow (assuming a 
favourable environment) and the proportionate growth rate r should be 
high. But when the population becomes too large, competition for basic 
resources such as food is bound to restrict further growth, and the 
proportionate growth rate will be lower. We therefore arrive at the 
conclusion that r will be a function of P, decreasing as P increases. This 
decline in the proportionate growth rate ensures that unlimited 
exponential growth will not occur. 


A particularly useful model arises from taking r(P) to be a decreasing 
linear function of P, as shown in Figure 6. We write this as 


r(P) =k (1 - a | (8) 


where k and M are positive constants. From this formula, you can see that 
the proportionate growth rate r is modelled as decreasing linearly with P, 
from the value k (when P = 0) to the value 0 (when P = M). For P> M, 
the proportionate growth rate is negative. 


Using this expression for r, the differential equation satisfied by P becomes 


dP P 
—=kP(1—-—}. 9 
g = #P (1-7) ® 
This is well known to biologists as the logistic equation. You will see 
how to solve this equation exactly in Section 2, and how to analyse its 
qualitative behaviour in Section 3. 


1 Some basics 


It is worth noting that not all differential equations are derived like those 
above, from first principles using small increments. In science, many 
differential equations come from physical laws that are framed directly in 
terms of the rates of change of physical quantities. For example, Newton’s 
second law tells us that 
dx 

F=m ae? 
where F is the force acting on a particle of mass m, and d?x/dt? is the 
acceleration of the particle in the direction of the force. This law produces 
a large variety of differential equations whose precise form depends on the 
nature of the force. 


The Voyager missions 


A spectacular example of the use of Newton’s second law is illustrated 
in Figure 7, which shows the trajectories of the two Voyager 
spacecraft, launched in 1977. 


VOYAGER 1 VOYAGER 2 


Launch 


20 Aug 77 VOYAGER 2 


Neptune 
25 Aug 89 


Oral 79 VOYAGER 1 


Saturn 


12 Nov 80 Saturn 
25 Aug 81 


Figure 7 ‘The trajectories of the two Voyager spacecraft were 
planned by solving differential equations 


These trajectories were carefully designed to enable the spacecraft to 
pass close to various planets and eventually to escape from the Solar 
System. The trajectories were planned by solving Newton’s second 
law for the position and velocity of each spacecraft. The resulting 


95 


Unit 2 First-order differential equations 


Figure 8 A montage of 
images taken by the two 
Voyager spacecraft, showing 
Saturn, its rings and six of its 


moons 


Some of this terminology and 
notation was discussed in Unit 1. 


96 


differential equations are complicated by the fact that the forces 
depend on the positions of the planets, which depend on time. Such 
equations are much more complicated than the logistic equation, and 
in practice are solved on computers using numerical methods. 


Figure 8 shows a fruit of these labours: close-up views of Saturn and 
its moons. The moon in the foreground is Dione, which is composed 
mainly of water ice. 


Exercise 1 


Suppose that a population obeys the logistic equation (with a 
proportionate growth rate given by equation (8)), and that you are given 
the following information. When P = 10, the proportionate growth rate 
is 1, and when P = 10000, the proportionate growth rate is 0. Find the 
corresponding values of k and M. 


1.2 Differential equations and their solutions 


This subsection introduces some of the fundamental concepts associated 
with differential equations. First, however, you are asked to recall some 
terminology and notation from your previous exposure to calculus. 


The derivative of a variable y with respect to another variable x is denoted 
in Leibniz notation by dy/dx. In this expression we refer to y as the 
dependent variable and x as the independent variable. 


Other notations are also used for derivatives. If the relation between 
variables x and y is expressed in terms of a function f, so that y = f(x), 
then the derivative may be written in function notation as f’(a). A further 
notation, attributed to Newton, is restricted to cases in which the 
independent variable is time, denoted by t. The derivative of y = f(t) can 
be written in this case as y, where the dot over the y stands for the d/dt of 
Leibniz notation. 


Thus we may express this derivative in any of the equivalent forms 


dy. / 
wo y= 7, 


Further derivatives are obtained by differentiating this first derivative. The 
second derivative of y = f(t) could be represented by any of the forms 

d?y " 

—{ =y= t): 

Be VHF 
These possible notations have different strengths and weaknesses, and 


which is most appropriate in any situation depends on the purpose at hand. 
You will see all of these notations used at various times during the module. 


1 Some basics 


It is common practice in science and applied mathematics to reduce the 
proliferation of symbols as far as possible. One aspect of this practice is 
that we often avoid allocating separate symbols to variables and to 
associated functions. So in place of the equation y = f(t) (where y and t 
denote variables, and f denotes the function that relates them), we write 


y = y(t), which is read as ‘y is a function of t’. (You saw an example of Strictly speaking, this is an 

this in the preceding subsection, where we used P and P(t).) abuse of notation, since there is 
ambiguity as to exactly what the 

The following definitions explain just what are meant by a differential symbol y represents: it is a 

equation, by the order of such an equation, and by a solution of it. variable on the left-hand side of 


y = y(t), but a function on the 
right-hand side. However, this is 
a very convenient abuse! 


Definitions 

e A differential equation for y = y(x) is an equation that relates 
the independent variable x, the dependent variable y, and one or 
more derivatives of y with respect to x. 


e The order of a differential equation is the order of the highest 
derivative that appears in the equation. Thus a first-order 
differential equation for y = y(x) features only the first 
derivative, dy/dx. 


e A solution of a differential equation is a function y = y(x) that 
satisfies the equation. 


These definitions have been framed in terms of an independent variable x 
and a dependent variable y. You should be able to translate them to apply 
to any other independent and dependent variables. Thus equation (9) is a 
differential equation in which t is the independent variable and P is the 
dependent variable. It is a first-order equation, since dP/dt appears in it 
but higher derivatives such as d?P/dt? do not. By contrast, the differential 


equation 
d?y dy 2 2 
335 4+2— sinx =x 
dx? + dx aie 


is of second order, since the second derivative d?y/dx? appears in it but 
higher derivatives do not. Second-order differential equations will be 
discussed in Unit 3. 


The topic of this unit is first-order differential equations. Moreover, it 
concentrates on first-order equations that can be expressed (possibly after 
some algebraic manipulation) in the form 

dy ; ; 

dn LCE (10) Equation (9) is of this form, 
The right-hand side here stands for an expression involving both, either or with f(t,P) = kP (1 = 7): 
neither of the variables x and y, but no other variables and no derivatives. 


97 


Unit 2 First-order differential equations 


This substitution includes the 
requirement that the function 
should be differentiable (i.e. that 
it should have a derivative) at 
all points where it is claimed to 
be a solution. 


The restriction y 4 0 placed on 
the differential equation in 
part (d) is necessary to ensure 
that —a/3y is well defined. 


98 


According to the definition given above, a function has to satisfy a 
differential equation to be regarded as a solution of it. The differential 
equation is satisfied by the function provided that when the function is 
substituted into the equation, the left- and right-hand sides of the equation 
give identical expressions. 


In the next exercise you are asked to verify that several functions are 
solutions of corresponding first-order differential equations. Later in the 
unit, you will see how all of these differential equations may be solved; but 
even when a solution has been deduced, it is worth checking by 
substitution that the supposed solution is indeed valid. 


Exercise 2 


Verify that each of the following functions is a solution of the 
corresponding differential equation. 


dy 


(a) y = Qe” — (x? + 2a + 2); = 4 9. 
dy 
(c) u=2e%/?; ul = xu. 
27-2? d 


dx 3y 
(e:) y=tte; y=—-ytt+l. 
(f) y=t+Cet; y=-y+t+t+1. (Here C is an arbitrary constant.) 


In the last two parts of Exercise 2 you were asked to verify that 
y=tte?’ and y=t+Ce* 


are solutions of the differential equation y = —y+t+1, where in the 
second case C' is an arbitrary constant. Whatever number is chosen for C, 
the corresponding expression for y(t) is always a solution of the differential 
equation. The particular function y = t + e~* is just one example of such a 
solution, obtained by choosing C = 1. 


This demonstrates that solutions of a differential equation can exist in 
profusion; as a result, we need terms to distinguish between the totality of 
all the solutions for a given equation and an individual solution that is just 
one of the possibilities. 


Definitions 


e The general solution of a differential equation is the collection 
of all possible solutions of that equation. 


e A particular solution of a differential equation is a single 
solution of the equation, and consists of a solution containing no 
arbitrary constants. 


In many cases it is possible to describe the general solution of a first-order 
differential equation by a single formula involving one arbitrary constant. 
For example, y = t + Ce~® is the general solution of the equation 

y =—y+t+4+1; this means that not only is y= t+ Ce a solution for all 
values of C’, but also every particular solution of the equation may be 
obtained by giving C’ a suitable value. 


Sometimes the values allowed for the arbitrary constant are restricted in 
some way. In the above example, if y is real-valued, we should take C' to 
be a real (rather than a complex) number. In other cases, the general 
solution makes sense only if the arbitrary constant C' is restricted to some 
range, and you will meet examples of this later in this unit. Nevertheless, 
any arbitrary constant is ‘arbitrary’ in the sense that it does not have a 
definite value. 


Exercise 3 


(a) Verify that, for any value of the constant C’, the function 


y=C- ze" is a solution of the differential equation 


dy _ 2-32 
ie =e 7 


(b) Verify that, for any value of the constant C’, the function 
u = Cet —t—1 is a solution of the differential equation 


w=t+u. 
(c) Verify that, for any value of the constant C’, the function 
p= CMe 
1+ Cekt 


is a solution of equation (9). 


As you have seen, there are many solutions of a differential equation. 
However, a particular solution of the equation, representing a definite 
relationship between the variables involved, is often what is needed. This 
is obtained by using a further piece of information in addition to the 
differential equation. Often the extra information takes the form of a pair 
of values for the independent and dependent variables. 


For example, in the case of a population model, it would be natural to 
specify the starting population, Py say, and to start measuring time from 
t = 0. We could then write 


P= Pp whent=0,_ or equivalently, P(0) = Pp. 


A requirement of this type is called an initial condition. 


1 Some basics 


99 


Unit 2 First-order differential equations 


100 


Definitions 


e An initial condition associated with the differential equation 


dy 

= ip 

AED) 

specifies that the dependent variable y takes some value yo when 
the independent variable t takes some value tg. This is written 


either as 
y = yo when t = to 
or as 


y(to) = Yo. 
The numbers to and yo are referred to as initial values. 


e The combination of a first-order differential equation and an 
initial condition is called an initial-value problem. 


The word ‘initial’ in these definitions arises from those (frequent) cases in 
which: 


e the independent variable represents time t, and the differential 
equation describes how a system evolves in time 


e the initial condition y(t) = yo describes the way the system is started 
off at some initial time to 


e  =we are interested in how the system behaves at times after the initial 
time to, so we want the solution y(t) for t > to. 


However, this situation is not essential. The term ‘initial condition’ is used 
even when we are interested in the solution y(t) for t < to or when the 
independent variable does not represent time. 


We usually expect that an initial-value problem should have a unique 
solution, since the outcome is then completely determined by the 
differential equation that governs the system, and by the configuration 
that the system had at the start. In fact, all the initial-value problems you 
will meet in this module have unique solutions, so if you can find a solution 
to an initial-value problem, then this is the solution. 


Example 1 


Using the result given in Exercise 3(b), find a solution to the initial-value 
problem 
dy 


aa a=cit 
tty y(0) 


1 Some basics 


Solution 


From Exercise 3(b), on replacing the variables t,u by x,y, the differential 
equation in this question has solutions of the form 


y =Ce* —ax-1. 


The initial condition tells us that y = 1 when x = 0, and on feeding these 
values into the above solution we find that 


1=Ce®°-0-1=C-1. 


Hence C = 2, and the particular solution of the differential equation that 
solves the initial-value problem is 


y = 2e* —a-1. 


Exercise 4 


The size P of a population (measured in hundreds of thousands) is 
modelled by the logistic equation 


dP P 
—=kP(1-— P(0O)=1 
dt ( a) ; wm) , 

where k = 0.15, M = 10, and ¢ is time measured in years. 


(a) Use the result given in Exercise 3(c) to find a solution to this 
initial-value problem. 


(b) Use your answer to predict the long-term behaviour of the population 
size (as t > oo). 


Finally in this subsection, note that we sometimes need to keep an eye on 
the domain of the function solving the differential equation. ‘Gaps’ in the 
domain usually show up as some form of restriction on the nature of a 
solution curve. For example, consider the differential equation 
dy 1 
aoe (11) 
It turns out that there are two distinct families of solutions, illustrated in 
Figure 9. One family of solutions (given by y = Inz+C) applies for z >0, Since |2| = —2 if « < 0, you can 
and another family (given by y = In(—x) + C) applies for x < 0. The see that this agrees with what 
right-hand side of the differential equation is not defined for z = 0, so there We know from Unit 1, namely 


is no solution there, and the two families do not cross the y-axis. ote 1 
[casa inlet +C. 
x 


101 


Unit 2 First-order differential equations 


102 


Figure 9 The two families of solutions, for > 0 and x < 0, arising from 
equation (11) 


2 Direct integration and separation of 
variables 


This section and the next look at methods for finding analytic solutions of 
first-order differential equations — that is, solutions expressed in terms of 
exact formulas. 


Until now we have considered first-order differential equations of the form 

dy 

~" — f(x,y). Eq. 10 

Fg Efe) (Eq. 10) 
We now specialise to two important cases where analytic solutions can be 
found. Each case involves a special technique. The method of direct 
integration applies to first-order differential equations of the form 

dy 

oN es 12 

Y = a(n), (12) 
where the right-hand side is a function of « alone. The method of 
separation of variables applies to first-order differential equations of the 
form 


H = a(x) hy), (13) 


where the right-hand side is the product of a function of x and a function 
of y. Of course, equation (12) is a special case of equation (13) with 

h(y) = 1. Nevertheless, it is convenient to discuss equation (12) first 
because it is solved in a very straightforward way. 


2.1 Direct integration 


An example of a differential equation that can be solved by direct 
integration is 


d 
9 


A (14) 


2 Direct integration and separation of variables 


In order to solve this equation, we need to find functions y(a) whose 
derivatives are x7; one such function is y = $a, There are other functions 


with this same derivative, for example, y = $x +landy= $x —2.In 
fact, any function of the form 


y= te? +¢, (15) 


where C is an arbitrary constant, satisfies differential equation (14), and 
equation (15) is the general solution of this differential equation. 


The expression 50° +C is also the indefinite integral of x7: that is, 
[eu = a" +C. 


In this case, therefore, the indefinite integral of x? is the general solution of 
differential equation (14), and a similar connection applies more generally, 
as you will now see. 


If we have a differential equation of the form 


dy 
a f(z), (16) 


where the right-hand side, f(x), is a known function of x alone, then we 
simply integrate both sides with respect to a: 


dy , 
dn c= | fle)az, 


The left-hand side gives y, so we get 
y= f fede =F@)+C, (17) 


where F'(x) is an integral of f(x), and C is an arbitrary constant. 


This means that the general solution of equation (16) can be written down 
directly as an indefinite integral; and if the integration can be performed, 
then the equation is solved. 


Procedure 1 Direct integration 


The general solution of the differential equation 


& = F(2) (Eq. 16) 
is 
y= | #0) dg = F(a) ehiG, (Eq. 17) 


where F(x) is an integral of f(x), and C is an arbitrary constant. 


Once the general solution has been found, it is possible to single out a 
particular solution by specifying a value for the constant C’. This value can 
be found by applying an initial condition. 


The values C = 0, C = 1 and 
C' = —2 give the three particular 
solutions mentioned above. 


This is hardly surprising, since 
integration ‘undoes’ or reverses 
the effect of differentiation. 


The function f(x) is assumed to 
be continuous (i.e. its graph has 
no breaks). 


103 


Unit 2 First-order differential equations 


104 


Example 2 


(a) Find the general solution of the differential equation 


d 
oY _ ¢-8e, 
dx 
(b) Find the particular solution of this differential equation that satisfies 


the initial condition y(0) = 3. 


Solution 


(a) On applying direct integration, we obtain the general solution 
y= pe dz = —te** +C, 


where C is an arbitrary constant. 


(b) The initial condition y(0) = 3 tells us that y = 3 when x = 0, so we 
must have 


which gives C' = 2. The required particular solution is therefore 


y= te" + 2. 


Procedure 1 uses x for the independent variable and y for the dependent 
variable. As usual, you should be prepared to translate this into situations 
where other symbols are used for the variables. But remember that the 
method of direct integration applies only to first-order differential 
equations for which the derivative is equal to a function of the independent 
variable alone. Thus direct integration can be applied, for example, to the 
differential equation 


dx 

a = cost, 
which has t as the independent variable and x as the dependent variable. 
In this case, the general solution is 


c= [eostat =sint+C, 


where C is an arbitrary constant. On the other hand, the differential 
equation 


d 

dv _ 9 

dt 
cannot be solved by direct integration because the right-hand side here is a 
function of the dependent variable, x. 


2 Direct integration and separation of variables 


Exercise 5 


Solve each of the following initial-value problems. 


(a) “ =te, Glj=—s: 


dv 4 
—=e = 2: 


(c) y=5sin2t, y(0)=0. 


The method of direct integration succeeds in solving a differential equation 
of the specified type whenever it is possible to carry out the integration 
that arises — a task that may require you to apply any of the standard 
techniques of integration, such as the use of tables, integration by parts or 
integration by substitution (see Unit 1). For more difficult integrals, a 
computer algebra package may be used. 


Exercise 6 
Find the general solution of each of the following differential equations. 


d 
(a) OF _ pe 2e (Hint: For the integral, try integration by parts.) 


(Hint: For the integral, try the substitution u = 1+ ¢?.) 


The solution to Exercise 6(b) can be generalised to any differential 
equation of the form 


dy _, f'(a) or tla 


where k is a constant. We get the general solution 
y =kin|f(z)|+C, 


where C is an arbitrary constant. 


2.2 Separation of variables 


Direct integration applies, in an immediate sense, only to the very simplest 
type of differential equation, as described by equation (16). However, all 
other analytic methods of solution for first-order equations also eventually 
boil down to performing integrations. In this subsection, we consider how 
to solve first-order differential equations of the form 


= a(n) hy), (as) 


where the right-hand side is a product of a function of « and a function 
of y. Such first-order differential equations are said to be separable. 


Remember that y stands for 
dy/dt, where t denotes time. 


This is a simple extension of the 
result from Unit 1 that 


fs aie 
| FGier=mr@l+e, 


for f(x) £0. 


105 


Unit 2 First-order differential equations 


Because y > 0, division by y 
causes no problems. 


See equation (81) in 
Subsection 6.3 of Unit 1. 


106 


An easy example 
Let us start with the simple example that we solved in the Introduction, 
namely 


— = —Ay, (19) 


where y(x) > 0. This equation is of the required form, with g(x) = 1 and 
h(y) = —Ay. In the Introduction we obtained the solution by guesswork, 
but we will now use a systematic method to find the solution. 


Dividing both sides of equation (19) by y, we obtain 


Integrating both sides then gives 


/-2 z= A f az, 
y dx 


Applying the rule for integration by substitution to the left-hand side, we 
obtain 


[ow=-A fae 
Y 


Integrating both sides (and remembering that y > 0) then gives 
Iny+C, = Co — Az, 


where C and C2 are constants of integration. We can rearrange this 
equation, and combine the constants, to obtain 


Iny=C— Az 
for some constant C' = Cy) — C;. Taking exponentials on both sides, we get 
y = exp(C — Az) = exp(C) exp(—Az) = Bexp(—Az), 
where B = exp(C) is another arbitrary constant. This is the same as the 
solution obtained (by guesswork) in the Introduction. 


In this case, the arbitrary constant B is restricted to positive values 
because exp(C) > 0 for all C. (This restriction is related to our initial 
requirement that y is positive.) Note also that B is not simply added to 
the solution in this case. 


A more typical example 


Another example of a separable differential equation, as specified in 
equation (18), is 

a =x(1+y’). (20) 
Here, the right-hand side is a product of the functions g(x) = x and 
h(y) =1+y?. You are unlikely to guess the solution in this case. Here we 
develop the systematic separation of variables method for its solution, 


2 Direct integration and separation of variables 


so-called because we rewrite the equation in a form where only y appears 
on one side, and only x on the other. 


As a first step, we divide both sides of this equation by 1+ y?, to obtain Note that 1+ y? is never zero, 


1 dy so it is safe to divide by it. 
1+y? dz —— 
and then integrate both sides with respect to x, to get 
1 dy 
—y — dr = dx. 21 
y 1+y? dz - fe . (21) 


The rule for integration by substitution tells us that the left-hand side of See Section 6 of Unit 1. 
this equation can be rewritten as 


1 dy 1 
a tee. ee 
eee: dx lap 


so equation (21) becomes 


1 


At this point, we have achieved the desired separation: the left-hand side 
depends only on y, and the right-hand side depends only on x. On 
performing the two integrations (using the table of standard integrals in 
the Handbook) we obtain 


arctany = 427 + C, (22) 
where C is an arbitrary constant. Making y the subject of the equation, Note that one arbitrary constant 
we obtain the solution is sufficient here. 

y(x) = tan ($27+C). (23) 


General procedure 


The approach just demonstrated applies more widely. In principle, it 
works for any differential equation of the form 


= g(a) h(y). (Bq. 18) 
On dividing both sides of this equation by h(y) (for all values of y other 
than those where h(y) = 0), we obtain 
1 dy 
nly) ae g(x). 


Integration with respect to x on both sides then gives 


fig cen [owe 


and, on applying the rule for integration by substitution to the left-hand 
side, this becomes 


il 
/ iw) dy = [oo dx. (24) This is the form that you need 
ov] 


to remember! 


107 


Unit 2 First-order differential equations 


It is a good idea to check, by 
substitution into the original 
differential equation, that the 
function obtained is indeed a 
solution. 


108 


If the two integrals can be evaluated at this stage, we obtain an equation 
that relates x and y, and features an arbitrary constant. This equation is 
the general solution of the differential equation (for values of y other than 
those where h(y) = 0). Usually, however, y is not the subject of this 
equation. Such a solution is said to be an implicit general solution of 
the differential equation: an example is provided by equation (22). 


Usually, the final aim is to make y the subject of the equation, if possible — 
that is, to manipulate the equation into the form 


y = function of z. 


This is called the explicit general solution of the differential equation: 
an example is provided by equation (23). 


In either case (implicit or explicit), a particular solution can be obtained 
from the general solution by applying an initial condition. 


The method just described for solving differential equations of the 

form (18) is called the method of separation of variables since, in 

equation (24), we have separated the variables to either side of the 
equation, with only the dependent variable appearing on the left, and only 
the independent variable on the right. 


So far, we have assumed that h(y) 4 0. However, the condition h(y) = 0 
corresponds to dy/dx = g(x) h(y) = 0 for all z, and so gives extra solutions 
to the differential equation with y = constant. These exceptional cases 
should be included in the general solution. The method is summarised 
below. 


Procedure 2 Separation of variables 


This method applies to separable differential equations, which 
are of the form 
H = a(x) Aly). (Bq. 18) 
1. Assume that h(y) 4 0, and divide both sides of the differential 
equation by h(y). Integrate both sides with respect to x. The 
rule for integration by substitution then gives 


| ais / vrs (Eq. 24) 


2. If possible, perform the integrations, including an arbitrary 
constant. 


3. If possible, rearrange the formula found in Step 2 to give a family 
of solutions y(x), including an arbitrary constant. 


4. Find any constant values of y for which h(y) = 0. 


5. The general solution is given by the family of solutions in Step 3, 
supplemented by any constant solutions found in Step 4. 


2 Direct integration and separation of variables 


The method of separation of variables is useful, but there may be some 
difficulties. First, it may not be possible to perform the necessary 
integrations. Second, it may not be possible to perform the necessary 
manipulations to obtain an explicit solution. 


It is sometimes necessary to be careful about the domain of the solution 
obtained, as the following example illustrates. 


Example 3 

(a) Find the general solution of the differential equation 
dy x 
a (y > 0) 


(b) Find the particular solution that satisfies the initial condition y(0) = 3. 


Solution 


(a) The equation is of the form 


H = a(x) Aly) 


where the obvious choices for g and h are 


g(t) =—2 and h(y) = 1/(3y). 

We now apply Procedure 2. On dividing through by h(y) = 1/(3y) 
(i.e. multiplying through by 3y) and integrating with respect to x, the 
differential equation becomes 


[sudy= [ wae. 


Evaluating the integrals gives 

by al? 40, 
where C is an arbitrary constant. This is an implicit form of the 
general solution. 


On solving for y and noting the condition y > 0 given in the question 
(which determines the sign of the square root), we obtain 


| $(2C — x”), 


This can be simplified slightly by writing B in place of 2C’. So 


y = 4/3(B- 2°), 


where B is an arbitrary constant. In fact, the value of B is somewhat 
restricted. This is because the condition y > 0 implies that y is real, 
and this requires that B — 2? > 0. Hence B > x? > 0, which tells us 
that B must be restricted to positive values. At the same time, the 
condition B — x? > 0 restricts the possible values of x to lie between 


—/B and VB. 


Notice that h(y) is never equal 


to zero. 


With practice, you will be able 
to move directly to this stage, as 


shown in Procedure 2. 


109 


Unit 2 First-order differential equations 


You verified in Exercise 2(d) 
that this function is a solution of 
the differential equation. 


The differential equation here is 
the logistic equation 

(equation (9)), which is used to 
model population sizes. 


110 


The explicit general solution in this case is therefore 


y = 1/4(B - 2?) (-VB <a < VB), 
where B is any positive constant. 
(b) The initial condition is y(0) = 3, so we substitute x = 0 and y = 3 into 


the general solution above. This gives 3 = \/3B , so B = 27, and the 
required particular solution is 


y(x) = 1f/4(27-22) (-3V3 <2 < 3v3). 


Here are some exercises on applying the method of separation of variables. 
Remember the following points: 


e The method initially assumes that h(y) 4 0 and then gives a family of 
solutions containing an arbitrary constant. 


e The condition h(y) = 0 may give extra solutions y = constant. These 
should be included in the general solution; they may or may not have 
the same form as the general family of solutions obtained for h(y) 4 0. 


When doing these exercises, try to obtain the general solution, including 
all cases where h(y) = 0. 


Exercise 7 


Find the general solution of each of the following differential equations. 


dy yl dy dy 
a a ee AD ae 


Exercise 8 
Solve the initial-value problem 


dv utu 


da = ; v(0) = 0. 


Exercise 9 
Find the general solution of each of the following differential equations. 


(a) w=au  (b) ¢=14+2? 


Exercise 10 

(a) Solve the initial-value problem 
dP P 
~~ =kP(1-—) with P(o) =2M 
dt ( M ) i (0) 


where P(t) > 0, and & and M are positive constants. 


3 Solving linear differential equations 


You may use the fact that 
1 I 
[—— dx = —In|——a 
x(1— az) x 


where C is an arbitrary constant of integration. 


+C for z#1/a, 


(b) Describe what happens to the solution P(t) as t becomes large and 
positive. 


3 Solving linear differential equations 


This section presents one final method for finding analytic solutions of 
first-order differential equations. The method, called the integrating factor 
method, applies only to a particular form of equation known as a linear 
differential equation. The definition and some properties of this type of 
equation are introduced in Subsection 3.1. Subsection 3.2 shows how to 
solve a particularly simple type of linear differential equation before we 
move on to the general method of solution in Subsection 3.3. 


3.1 Linear differential equations 


This subsection introduces the concept of linearity as applied to 
differential equations. Here the concept is introduced in the context of 
first-order differential equations, but you should be aware that the idea 
generalises to higher-order differential equations and is important from a 
theoretical point of view. 


Definitions 


e_ A first-order differential equation for y = y(x) is linear if it can 
be expressed in the form 


H+ g(a) y = Ala) (25) 


where g(x) and h(x) are given functions. 


e_ A linear first-order differential equation is said to be 
homogeneous if h(x) = 0 for all z, and inhomogeneous or 
non-homogeneous otherwise. 


Note that linearity refers to the dependent variable y. Terms like y? or 
y (dy/dx) are excluded, but any function of the independent variable x is 
allowed. So, for example, the differential equation 
d 
ao a’y =x 


dx 


3 


Linear second-order differential 


equations are considered in 
Unit 3. 


This differential equation is of 


the general form 


d 
~ = f(x,y) 


used elsewhere in this unit, with 


f(x,y) =—g(x) y + h(z). 


111 


Unit 2 First-order differential equations 


In equation (25), g(x) is the 
coefficient of y. 


Equations of this type are met in 
the theory of electrical circuits. 


112 


is linear, with g(x) = —a? and h(x) = x?, whereas the equation 
dy 2 
dc 


is not, due to the presence of the non-linear term y?. 


It can be shown that any initial-value problem based on equation (25) has 
a unique solution provided that g(a) and h(a) are continuous functions. 


Exercise 11 


Decide whether or not each of the following first-order differential 
equations is linear. 
dy 3 5 dy . 
(a) —+a2°y=2 (b) + =asing (c) 
dx da 


dz 
-* — —3,71/2 
dt ' 


d 
@g+y=t @es+y=yP © (+a) 2 4 dey = 32" 


3.2 Linear constant-coefficient equations 


The simplest example of a linear equation occurs when the function g(x) 
does not depend on z, that is, where the coefficient of y is constant. In the 
case of a homogeneous equation, this is just the simple example that we 
dealt with in the Introduction: if g(a) = A, then the homogeneous 
equation is 


dy 
—+ Ay= 
A y = 0, 


which has solution y = yo exp(—Az), as you saw in Section 2. 


Now consider the case of the inhomogeneous linear differential equation 
with constant coefficients, which is of the form 


dy 

— + Ay = h(x). 2 

dg + AY = h(a) (26) 
We will now use a trick to solve this equation, which at first sight is not 


obvious. We begin by multiplying both sides by e4”: 


ee ow + AeA? y = eA” h(x). (27) 


Now, using the product rule for differentiation, we notice that 


d d 
leu) = eft 7 ~ AeA*y, 


This means that the left-hand side of equation (27) can be written as 


< (e47y), so we have 


(e4*y) = e4? h(x). (28) 


This differential equation can now be solved by integrating both sides with 
respect to x and rearranging the result. 


3 Solving linear differential equations 


It is easier to see how this works by considering particular examples, but 
for the record, the general method proceeds as follows. Integration gives 


/ £ (Ay) 17= a h(x) dx, 


which becomes 


eAty — fe h(a) dx, 
so 


nya et ( i eA” h(x) i) . 


This shows that if we can perform the integral on the right-hand side, then 
we can solve our original differential equation (26). The integral will 
generate an arbitrary constant, and our solution will be the general 
solution of the differential equation. 


(29) 


Example 4 


Solve the linear differential equation 
dy 


— 4% =z, 
aq 


Solution 

The differential equation has the same form as equation (26), with A = 2 
and h(x) = x, so we must multiply by the factor e?”. Multiplying both 
sides of the differential equation by e?", we get 


d 
e2% ay 4 e7* Dy = ety. 
dx 


Following equation (28), the differential equation can then be expressed in 
the form 


# (¢2ey) = erp, 


dx 
Integrating both sides gives 


erty = [oe dz. 


The integral on the right-hand side can be evaluated by parts, using the 
formula 


fl f(a) 9'(«) dx = f(«) ge) — / ge? ae. 


Choosing f(x) = x and g'(x) = e?”, we have f’(x) = 1 and g(x) = 5e 


erty = gxe™* — af dx 
—_1,,22 __ 1 (1,22 
= 7re 5 (Se + C) 
_ 1,22 _1)_il 
= 9e" (u«— 5) —9C. 


We can omit the arbitrary 
constant of integration here 
because the remaining indefinite 
integral will still generate one. 


There is no difficulty in dividing 
through by e4” because this is 
never equal to zero. 


113 


Unit 2 First-order differential equations 


114 


Thus 
y(a) = 9 (w@-g—Ce™*), 
where C is an arbitrary constant. 
We can check this answer by differentiation: 
—+2y= $ (1 + 2Ce~?*) + (a — “ — Ce **) = 2; 
x 


as required. 


Exercise 12 
Solve the differential equation 


dy 20 
Peas ae 


with the initial condition y(0) = 0. Check your solution by differentiation. 


You have seen how to solve linear differential equations with constant 
coefficients. This method is easily generalised to the case where the 
coefficient is not constant. The general form of the technique is called the 
integrating factor method. 


3.3. The integrating factor method 


The secret to solving the linear constant-coefficient differential equation 


dy 
ek May 
Tg + AY = h(a) 
is to multiply both sides by e4” and to notice that 
d 


d 
ae 8) _ et 7 + eA Ay, 


which is e4* times the left-hand side of the differential equation. 
We will now try something very similar for the general linear differential 


equation 


o +g(x)y = h(x). (Eq. 25) 


In this case, we multiply both sides by e@), where the function G(z) 
remains to be decided. This gives 


d 
ee) ~ + e®) g(x) y = eF h(a). (30) 
We then notice that the chain rule of differentiation gives 


») aG 
aS 


d (ca) ,) — Ga) Y . .G 
(e u) “< dx ve dx 


dx 


3. Solving linear differential equations 


This can be made equal to e@) times the left-hand side of the differential 
equation (25) provided that we take 
dG 
soos 31 
e = o(2), (31) 
and this implies that 
Gini = / wade: 
With this choice of G(x), equation (30) can be written as 
d 
ae y) = eG(2) h(a), 


and the solution of the differential equation is then obtained by integrating 
both sides and rearranging the result. 


You will soon see how this works in explicit examples. For the moment, 
the key point is that we need to multiply both sides of the equation by the 
factor 


A= Oa as ( / ae) i) | (32) 


This factor is called the integrating factor of the differential equation 
because it allows the left-hand side of the equation to be integrated 
exactly. This leads to the following procedure. 


Procedure 3. The integrating factor method 


This method applies to any first-order linear differential equation, 
that is, any equation of the form 


4 o(0) y = W( 2) (Eq. 25) 


1. Determine the integrating factor 


Ba) aes ( / He ax) (Eq. 32) 


2. Multiply equation (25) by p(x) to recast the differential equation 
as 
dy 
p(x) + p(x) g(x) y = p(x) A(z). 
3. Rewrite the differential equation as 


<(o(2) 9) = pla) he), (33) 


4. Integrate both sides of this equation, including an arbitrary 
constant, and rearrange the result to make y(x) the subject of 
the equation. The result is the general solution of the differential 
equation. 


We can omit the arbitary 
constant when evaluating this 
integral. This is because we 
need only one function G(a) 
that satisfies equation (31). We 
do not need the general function 
to do so. 


The constant of integration is 
not needed here as it can be 
shown to cancel out in Step 3. 


It is a good idea to check that 
the derivative on the left-hand 
side reproduces the left-hand 

side of the preceding equation. 


115 


Unit 2 First-order differential equations 


Let us first apply this procedure to the constant coefficient equation (26), 
to check that it gives the correct solution (29). 


Example 5 


Use the integrating factor method in Procedure 3 to solve 


dy _ 
a + Ay = h(x), 


where A is a constant. 
Solution 


Comparing the equation in the question with equation (25), we see that 
g(x) = A. So the integrating factor, equation (32), is 


p(x) = exp ( | Aar) = exp(Az). 
This is exactly the integrating factor that we used in Subsection 3.2. 
Equation (33) then gives 
d 
Ce) = e"* h(z), 


which is the same as equation (28). The remainder of the procedure 
follows the path that led to equation (29), so we get the solution 


y(a) = e-A* ( i eA n(z)) . 


As with the separation of variables method, it may not be possible to 
perform the necessary final integration. However, the remainder of this 
subsection gives examples and exercises for which the integrals can be 
done. It is important to note that the constant of integration must be 
included in the final integration, as this is what makes the solution a 
general one. 


Example 6 


Use the integrating factor method to find the general solution of the 
differential equation 


dy 2xry 

—=27- : 

dx z2+1 
Solution 


On rearranging the differential equation as 
dy Qry 
dz 22 +1 
we see that it has the form of a linear differential equation 
dy/dx + g(x) y = h(x), with 


ga) = a j and hia) =x. 


x, 


116 


3 Solving linear differential equations 


The integrating factor (from equation (32)) is therefore 


aol ee, 


Notice that 27/(x? + 1) is of the form f’/f, so 


2 
faaernt ey 


2241 
=In(x?+1) since z?+1>0. Remember that we do not need 
. . . to include an arbitrary constant 
The integrating factor is therefore when finding an integrating 


p(x) = exp(In(z? + 1)) = 2? +1. factor. 


Multiplying both sides of the differential equation by this factor yields 


d 
(a? + 1) ~ + 2ry = x(x? + 1), 


and the differential equation becomes 
<((@? + 1)y) = a(x* +1). 
Integrating both sides gives 
(2? + 1)y = [eo +1)dz 
fe +2) dz 


= pu + 5a? +0, 


where C is an arbitrary constant. Finally, to obtain an explicit solution, 
we divide by x? + 1 to get 


gates 
WO Aa +1)” 


where B = 4C is an arbitrary constant. 


The next example contains two differential equations that were solved in 
Exercise 7 using the method of separation of variables. Here, we solve 
them using the integrating factor method. You can compare the answers 
with those obtained earlier. 


Example 7 


Use the integrating factor method to find the general solution of each of 
the following differential equations. 


(a) H=¥* so) 


dx 
dy 2y 


117 


Unit 2 First-order differential equations 


Solution 


(a) 


Recall that alna = In(2*). 


It is always worth a quick check 
at this stage: is the left-hand 
side of equation (36) equal to the 
left-hand side of equation (35)? 


The equation is homogeneous. 


118 


On rearranging the differential equation as 


dy 1 1 
de — yr", (34) 
. = x 
we see that it is of the linear form dy/dx + g(x) y = h(x) with 
1 
g(a) = h(a) =—-—. 
a 


The integrating factor (from equation (32)) is therefore 


woreeo({(2)e) 


=exp(—Inz) (since x > 0) 


= exp (In x") 
=g l= Z 
x 


gives 
2H _ FS y=-5 (35) 
so the differential equation becomes 
d {1 1 
= . 36 
dx €E 1) a eo 
Integration then gives 


x x av 


where C is an arbitrary constant. The general solution is therefore 


y=1+Cz, 
where C is an arbitrary constant. 
In order to put the given differential equation into the linear form 


dy/dx + g(x) y = h(x), we need to bring the term in y to the left-hand 
side to obtain 


dy 2 a5 
de ita’ ~ 
Hence in this case we have g(x) = —2/(1 +2”) and h(x) = 0. 


(37) 


To find the integrating factor, we must evaluate the integral 


2 
[oe if= / (-3) dx = —2arctanz, 


so the integrating factor is 


exp (fae) i) = exp(—2 arctan L) = e2 arctan x 


3 Solving linear differential equations 


Multiplying through by this factor gives 


—2arctan x dy = e2 arctan x 2y _— 
dx 1+ 2? 
Thus the differential equation can be rewritten as 
d 


re (e? ry) =0. 


It follows, on integrating, that 


_ Ce? arctan & 


e 2 arctan Ty _~ C. 


or equivalently, y(x) ; 


where C is an arbitrary constant. This is the general solution of the 
differential equation. 


Exercise 13 


Find the general solution of each of the following differential equations. 


dy ee dy _ 
(a) 7 -y=e sin x (b) Ge=yte 
Exercise 14 


Use the integrating factor method to solve each of the following 
initial-value problems. 


(a) w=2u, u(0)= 2. 
(b) ty +2y=t?, y(1)=landt>0. 


Exercise 15 

Solve each of the following initial-value problems. 
(a) yt+y=tt+l1, y(1)=0. 

(b) e*y=1—e%y, y(0) =3. 


Exercise 16 


Find the general solution of each of the following differential equations. 


(a) oY sy =s (x > 0) 
(ey dpsed 
a t 40 = 8008 


(Hint: If a and 6 are non-zero constants, then 


at 


at € * 
fe cos bt dt = aa gpa (2 cos bt + bain bt) +C, 


where C is an arbitrary constant.) 


You can check that the 
derivative on the left gives the 
left-hand side of the previous 
equation (using the fact that 
d(arctan x) /dx = 1/(1+ 27)). 


You saw the differential equation 
in part (a) in Exercise 9(a), 
where you solved it using 
separation of variables. 


119 


Unit 2 First-order differential equations 


120 


You have now used a number of methods for solving first-order differential 
equations: direct integration, separation of variables and the integrating 
factor. Confronted with a fresh equation, the first issue is which method to 
try. The following exercise closes this discussion of analytic solutions by 
giving you practice at choosing an appropriate method. 


Exercise 17 


Which method(s) could you use to try to solve each of the following linear 
first-order differential equations? (You need not actually solve the 
equations. ) 


dy 3. 5 dy, dv = 
(a) iat” y=u (b) Ig 7 esine (c) mm 1 OU = 9 
d d 
(4) (1+2?)* +20y=1+0?  (e:) = y?(1 +2”) 


4 Direction fields 


The material in this section is non-assessable and will not be tested in 
continuous assessment or in the exam. However, you are advised to 
read this section and attempt the exercise, as this will provide 
valuable insights into the behaviour of solutions of differential 
equations. 


Many of the differential equations that arise in physics and applied 
mathematics cannot be solved exactly. For this reason it is valuable to 
know about methods for obtaining approximate or qualitative information 
about solutions. It is also important to be aware of how computers may be 
used to give numerical solutions of differential equations, and this will be 
considered in Section 5. 


In this section we show how qualitative information about the solutions of 
a first-order differential equation can be gleaned directly from the equation 
itself, without undertaking any form of integration process. The main 
concept here is the direction field, sketches of which usually give a good 
idea of how the graphs of solutions behave. 


We start by considering what can be deduced about solutions of any 
differential equation of the form 


—" = f(z,y) (Eq. 10) 


from direct observation of the equation. 


4 Direction fields 


In Subsection 1.1 we encountered the logistic equation Here we have 
dP P P 
_ppli— Ea. f(yP)=KP(1-F), 
Ai ( x) (Eq. 9) M 


where & and M are positive constants. In certain circumstances this is a 
useful mathematical model of population sizes, in which P(t) denotes the 
size of the population at time ¢. The constant functions P(t) = 0 and 
P(t) = M are particular solutions of the differential equation. This is 
because they both give dP/dt = 0 on the left-hand side and zero on the 
right-hand side, so the differential equation is satisfied in the form 0 = 0. 
Within the population model, these solutions correspond to a complete 
absence of the population (P = 0), and an equilibrium population level 
(P = M) for which the birth rate exactly balances the death rate. 


Spotting constant functions that are particular solutions of the differential 
equation is occasionally useful, but is of limited applicability. In general, 
more useful information can be deduced by noting that any particular 
solution y(a) can be plotted as a graph of y against x in the xy-plane. At 
a given point (Zo, yo), the gradient (or slope) of this graph is given by the 
value of dy/dx at (xo, yo), and according to the differential equation (10), 
this is given by f(xo, yo). We therefore conclude that f(x,y) represents the 
slope at (x,y) of a solution curve that passes through (a, y). 


For example, if 


dy _ _ 
qe TEV =Tty, (38) 


then the slope of a solution curve that passes through the point (1,2) is 
f(1,2) =1+2=23. This slope is positive, so the graph is increasing from 
left to right through the point (1,2). The slope of a (different) solution 
curve that passes through (2, —7) is f(2,-7) = 2 —7 = —5. This slope is 
negative, so the corresponding graph is decreasing from left to right 
through the point (2,—7). At the point (3, —3), the slope is 

f(3, —3) = 3 — 3 =0, so the solution curve that passes through this point 
is horizontal. 


When looking at f(x,y) in this way, it is referred to as a direction field, 
since it describes a direction (or slope) for each point (2, y) where f(z, y) is 
defined. 


Definition 


A direction field associates a unique direction to each point within 
a specified region of the xy-plane. The direction associated with the 
point (x,y) can be indicated by drawing a short line segment through 
the point, with the appropriate slope. This can be regarded as a 
segment of the tangent line at (x,y) of the solution curve that passes 
through (x, y). 


121 


Unit 2 First-order differential equations 


122 


In particular, the direction field for the differential equation 


dy 


associates the direction f(x,y) with the point (x,y). This is the slope 
at (x,y) of the graph of a particular solution y(x) that passes through 
the point (a, y). 


Direction fields can be visualised by constructing the short line segments 
at a finite set of points in an appropriate region of the plane, where 
typically the points are chosen to form a rectangular grid. An example is 
shown in Figure 10, which corresponds to the differential equation (38). In 
this case the chosen region is the set of points (x,y) such that —2 <a <2 
and 0 < y < 2, and the rectangular grid consists of the points at intervals 
of 0.2 in both the z- and y-directions within this region. 


yA 
Lox 
/ ft 


l 
| 
\ 


| 
| 
i 
I | \ 
on oe oe 
| 1 Ve RN 
a oe oe, 
Vy NON NOS 
VVNNNN 
\ 
WS 


| 
| 
\ 
\ 


| 
| 
\ 
\ 
ne 
% A AR Oe NO NN BO 


i i ae ae ae ae ae | 
ae F F I 
“ae fa 
Le ft A 
XS / 
/ 
/ 
| 
Y¥VNNNNNNNN™N™ 
YNNNNNNNNN™ 
NWN NNNNNNNNN 
wNNNNNNNN~N 
~wNNNNNNNN 
NNN NNINNCS nS i 
~~ NNN NNN TSO 


Le a Pe gt ge hf 
XLS LLL LT 
BEN NNN NNNNNN nN 
mo MN NON (A ™/~ i /™~— i /~_ i ~~ 


4 

4 

if 
of 
RY 


Figure 10 Part of the direction field for equation (38) 


From this diagram, we can gain a good qualitative impression of how the 
graphs of particular solutions of equation (38) behave. The aim is to 
sketch curves on the diagram in such a way that the tangents to the curves 
are always parallel to the local slopes of the direction field. For example, 
starting from the point (—1,0.5) (that is, taking the initial condition to be 
y(—1) = 0.5), we expect the solution graph initially to fall as we move to 
the right. The magnitude of the negative slope decreases, however, and 
eventually reaches zero, after which the slope becomes positive and then 
increases. On this basis, we could sketch the graph of the corresponding 
particular solution and obtain something like the curve shown in Figure 11. 


4 Direction fields 


YA 
Se a ee Ed dat fle Tat fF FA 


SS LK ETT TT ETE Ta 
ee ere ew et oS LF TL fo dt £ a fd 
Nw ~~ - oe ee er yr sre teste titt titi 
i ee ee ae? ae 2a ee a a ee a a 
Ve eS a a ee ie we PS of So PT 


-—7r47r77/// 4441 


VNNNNNNNBW HES Ze Yr YY LY SLT 


Y\ NNN ON NS SS 


= 


2 


Figure 11 Part of the direction field for equation (38), and the particular 


solution satisfying y(—1) 


0.5 that passes through the point (—1, 0.5) 


Exercise 18 


(a) Part of the direction field for the logistic equation 


This is equation (9) with k = 1 


and M = 1000. 


is sketched in the figure below. 


AW 
HAAS INNNNSSSSANNYN Sere 
--771|\\\NNNNA\AN LL 42424- 
-2-771I\\NNNNNA\AN IL L424 2- 
-2-7711IV\V\NNNNNAAN L174 2- 
--7711IV\V\NNNN\A\ F®442- 
-2-2771I\N\NNNNNNA\AN LILA 422 
H-2771I1\\NNNNNA\AN L144 2- 
He2-7771IV\N\NNNNNNAN LL 2424 
H-2771|I\\N\NNNNN\\\V FS 47e- 
--7711\V\NNNNNAAN L144 2- 
--771INNNNNNNAAN LL A4242- 
--7711IV\\NNNNNAAN J 14 242- 
--7711\\\NNNNNA\ ESM 424 2- 
--771|\\\NNNNN\\V LL 42424- 
--771I\\NNNNNAAN [L144 2- 
--7711IVN\NNNNNA\A ILI 4242- 
H-2-7711IV\\NNNNNA\V EN ZZ 2H 
H-2-7711IV\N\NNNNNAAN T1424 2- 
-2-7711\\N\NNNNN\AAV IL L424 24- 
H2771I\N\NNNNNNAAN LL A422 
rf err 
S S S = S 
2 = * " 


123 


Unit 2 First-order differential equations 


124 


Using this diagram, sketch the solution curves that pass through the 
following points: 


(0, 1500), (0,1000), (0,100), (0,0), (0,—100). 
From your results, describe the graphs of particular solutions of the 
differential equation. 


(b) What does your answer to part (a) tell you about the predicted 
behaviour of a population whose size P(t) at time t is modelled by this 
logistic equation? 


When analysing problems involving differential equations it is sometimes 
qualitative information that is most important. For example, if we model 
the population size P(t) of a species, it may be interesting to know 
whether the species dies out (does P(t) > 0 as t > 00?) rather than 
having an accurate expression for P at a given value of t. Often the 
direction field approach gives direct access to insights that would be hard 
to extract from exact solutions. It can also be applied to differential 
equations for which no exact solutions can be found. 


5 Numerical solutions and Euler’s 
method 


The material in this section is non-assessable and will not be tested in 
continuous assessment or in the exam. If you are running short of 
time, you can skip this section. 


5 Numerical solutions and Euler’s method 


In many cases, an exact solution of a differential equation cannot be found, 
and the most useful approach is to use a computer to find a numerical 
solution. The study of numerical methods for the solution of differential 
equations is a large area of knowledge. This section describes Euler’s 
method, which is a simple numerical method for first-order differential 
equations. Other methods are often used by experts, but most of these are 
just refinements of Euler’s method. 


Let us suppose that we wish to find a solution to the first-order differential 
equation 


dy 


with initial condition y(xo) = yo (i.e. we seek a solution that passes 
through the point (29, yo)). We will consider the solution for x > 29. 


We will approximate the solution of this initial-value problem by moving in 
a sequence of straight-line steps. Corresponding to the given initial 
condition y(zq) = yo, there is a point Po in the xy-plane with coordinates 
(xo, yo), and this is our starting point. At Po, the function y(x) has a 
particular slope, namely f (9, yo). We move off from Po along a straight 
line that has this slope, and continue until we have travelled a small 
horizontal distance h to the right of Py. The point that has now been 
reached is labelled P,, as in Figure 12 (where the distance h is exaggerated 
for clarity). 


YA 


slope = f (xo, yo) P, (21, Y1) 
il ? 


| | 

| | 

| | 

| | 

| | 

t f > 
0 XO Ly a 
Figure 12 Graphical representation of the first step in Euler’s method for 
numerically solving y’ = f (2, y) 


We denote the coordinates of P; by (a1, Y;). For comparison, the exact 
solution of the initial-value problem passes through a point (x1, yi), where 
yi = y(21) is the value of the exact solution at x = x,. Note that we 
cannot claim that Y; = y,;. This is unlikely to happen unless the exact 
solution function follows a straight line as x moves from x9 to x1. However, 
the hope is that because we headed off from xg along the correct slope, 

Y;, will be reasonably close to the exact value, y,. Furthermore, we expect 
that this approximation becomes better as the length h is decreased. 


125 


Unit 2 First-order differential equations 


The next thing that we need to do is obtain formulas for x; and Y; in 
terms of the known quantities x9, yo, h and f(xo, yo). Because the point 
P, is reached from Po by taking a step to the right of horizontal length h, 
we have 


We can also express Y; in terms of other quantities by equating two 
expressions for the slope of the line segment PoP; (see Figure 12): 


Vos 
a= = f (0, yo), 


which can be rearranged to give 
Yi = yo + h f (Xo, yo): (40) 
This completes the first stage of the method. 


We now take a second step. This second step takes us from P; through a 
further horizontal distance h to the right, to the point labelled P, in 


Figure 13. 
Yr 
slope = f (21, Y1) (x2, Y2) 
/¥e—-Y4 
P, 
| | 
Po | | 
| | 
| | | 
| | | 
| | | 
| | | 4 
0 xO X41 x2 a 


Figure 13. Graphical representation of the first two steps in Euler’s 
method for numerically solving y’ = f(a, y) 


The coordinates of P) are denoted by (2, Y2), and Y2 gives an 
approximation to the exact solution value y2 at the point x = x2. 
Following the same logic as for the first step, we write 


t=ath 
and 
Yo=y +h f(xi,¥%), 
where f(x1, Y1) is the slope dy/dz at the point P, = (#1, Y1). 


Having carried out two steps of the process, you can see that the same 
procedure can be applied to construct any number of further steps. 


126 


5 Numerical solutions and Euler’s method 


Suppose that after i steps we have reached the point P;, with coordinates 
(a;, Y;). For the (¢ + 1)th step, we move away from P; along the line with 
slope f(x;, Y;) (as defined by the direction field at P;). After moving 
through a horizontal distance h to the right, we reach the point P;,; whose 
coordinates are denoted by (x;441, Yi+1), as illustrated in Figure 14. 


YA 
Peri (isd, Yea) 
slope = f (a;, Y;) 
Feet 
ee ; 
2 | h : 
we | 
we | | 
Ye | | 
| | 
| | 
| | 
| { > 
0 Lj Li+1 x 


Figure 14 Graphical representation of the (i + 1)th step of Euler’s 
method for numerically solving y’ = f(z, y) 


The point P;,1 provides an approximation Y;,1 to the exact solution value 
Yiti = y(@i41) at x = x44. Arguing as before, we have 


Via1=Uj+h (41) 


and 


Yea = Yea): (42) 
To sum up, we have a procedure for constructing a sequence of points 
P; with coordinates (x;, Y;) for i= 0,1,2,..., 


where the values of x7; and Y; for each value of 7 are determined by 
equations (41) and (42). The starting point for the sequence is the point 
Po with coordinates (29, Yo), where Yo = yo. The horizontal distance h by 
which we move to the right at each stage of the procedure is called the 
step length or step size. 


The sequence of points Po = (x0, yo), Pi = (#1, Yi), Po = (x2, Y2), ... 
provides an approximate solution to the initial-value problem based on the 
differential equation dy/dx = f(x,y) and the initial condition y(xo) = yo. 
In other words, when the independent variable has value «;, the exact 
solution y(a;) is approximated by Y;. The method just used to generate 
this sequence is called Euler’s method. 


127 


Unit 2 First-order differential equations 


Procedure 4 Euler’s method 


To apply Euler’s method to the initial-value problem 


Y= f(x,y), y(20) = 90 


proceed as follows. 


1. Take xp and Yo = yo as starting values, choose a step length h, 
and set i = 0. 


2. Calculate the x-coordinate x;41 using the formula 
EG et (Eq. 41) 


3. Calculate a corresponding approximation Y;,1 to y(a;+41), using 
the formula 


Vier = Yit hf (wi, Yi). (Eq. 42) 


4. If further approximate values are required, increase i by 1 and 
return to Step 2. 


How well does Euler’s method work? Figure 15 shows the constructed 
sequence of points and, for comparison, shows a graph representing the 
exact solution of the initial-value problem. This makes it clear that the 
successive points P,, Po, P3,... are only approximations to points on the 
solution curve. In fact, the situation shown in Figure 15 is typical of the 
behaviour of the constructed approximations, in that they gradually move 
further and further from the exact solution curve. This is because at each 
step, the direction of movement is along the slope at P; = (a;, Y;) and not 
along the slope at the position reached by the exact solution (2;, y;), where 
yi = y(Xi). 

y A 


P4 


> 
0 Zo Ly r2 x3 LA w 


Figure 15 Graphical representation of the numerical and exact solutions 
of a differential equation 


128 


5 Numerical solutions and Euler’s method 


Improvements in accuracy can usually be achieved by reducing the step 
length h. This is illustrated in Figure 16. Of course, the improvement 
comes at the cost of having to use more steps, and therefore using more 
computer time. 

A 
estimate 
with h = 0.4 


with h = 0.2 


with h = 0.1 


exact solution 
at x = 0.4 


T T T T > 
0 0.1 0.2 0.3 0.4 a 
Figure 16 Comparison of the numerical and exact solutions of a 
differential equation for various step lengths h 


129 


Unit 2 First-order differential equations 


Figure 17(b) shows a rack from the Japanese K supercomputer 
(2009), which can complete in one second computations that would 
have taken the original Cray-1 four years. 


Figure 17 (a) The Cray-1 supercomputer, introduced in 1978; 
(b) a rack from the Japanese K supercomputer, introduced in 2009 


Learning outcomes 


After studying this unit, you should be able to do the following. 


130 


Understand and use the basic terminology relating to differential 
equations and their solutions. 


Check by substitution whether a given function is a solution of a given 
first-order differential equation or initial-value problem. 


Find from the general solution of a first-order differential equation the 
particular solution that satisfies a given initial condition. 


Recognise when a first-order differential equation is soluble by direct 
integration, and carry out that integration when appropriate. 


Recognise when a first-order differential equation is separable, and 
apply the method of separation of variables. 


Recognise when a first-order differential equation is linear, and solve 
such an equation by the integrating factor method. 


Solutions to exercises 


Solution to Exercise 1 


P 
We have r(P) =k (1 - =): so we simply need to solve the following pair 


of simultaneous equations: 


10 000 


From the second equation, since k > 0, we see immediately that 
M = 10000. Substituting in the first equation leads to 


999 1000 
aay ea 
1000 ~ 1 8° *= ggg 


Solution to Exercise 2 


(a) In each case, differences in notation notwithstanding, the differential 
equation has the form 


dy 


da = F(a, 0), 


and we need to show that a given function y = y(z) satisfies this 
equation. We can do this by substituting y = y(x) into both sides of 
the equation and showing that the results are identical. Alternatively, 
if the right-hand side is simple, we can substitute y = y(x) into the 
left-hand side and then rearrange the result to show that it is equal to 
the right-hand side. 


In this case, to show that y = 2e” — (a? + 2x + 2) satisfies 


dy 
— = f(x,y) =yt+2’, 


dx 
we first differentiate y to get 
dy 
— = 2e” — 2a — 2. 
dx 


Then, substituting the expression for y into f(x,y), we get 
f(t,y) =y +2? = 2e* — (2? + 22 +2) +27 = 2e* — 22 — 2, 


as required. 


Solutions to exercises 


131 


Unit 2 First-order differential equations 


(b) To show that y = $x? + 3 satisfies 


dy 
dx" 
we simply differentiate y to get 
dy 
dx 


which establishes the required result. 
(c) To show that u = 2e*’/? satisfies 
ul = f(x,u) = wu, 


we differentiate u to get 


du 2 
i = Sore 7, 
dx 
and then note that the right-hand side of the differential equation is 


2 
f(x,u) = cu = 2xe* /?, 
as required. 


(d) To show that 


ee viet a7 = 2° ae 
Ci 3 -. 3 ’ 


with (-3V3 < x < 3V3), satisfies 


we first differentiate y to get 
dy 97 — 22\ 1? 
= 5 ; 
Then, substituting the expression for y into f(x,y), we get 
ren x a (27 —-a2\—1? 
ZL, = a Oe a. ’ 
o 3y.« «38\ 3 
as required. 
(e) To show that y = t+ e~* satisfies 
we first differentiate y to get 
dy 
a 
Then, substituting the expression for y into f(t, y), we get 
fit,y) =—-yt+t+1=—-(¢+e*)+t4+1=1-e%, 


as required. 


t—<¢". 


132 


(f) To show that y=t+Ce~ satisfies 
Y= Jeg) =—ytr+ 1, 
we first differentiate y to get 


dy 
p= — =1=—Ce* 
Yat 
Then, substituting the expression for y into f(t, y), we get 


f(t,y) =—-yt+t+1=—-(t+Ce)+t+1=1-Ce™, 


as required. 


Solution to Exercise 3 

(a) To verify that y = C — $e~** satisfies 
d 
ay = eo, 
dz 


we differentiate y to get 


dy _ —4(—3e7**) = e73t 


dx 


as required. 
(b) To verify that u = Ce’ —t—1 satisfies 
u= f(t,u)=tt+u, 
we differentiate u to get 
_ du 


(—— =]Ce =1. 
U a Ce 


Then, substituting the expression for u into f(t, u), we get 


ft,u) =t+u=Ce'—-1, 


o] 


as required. 
(c) To verify that P(t) = CMe**/(1 + Ce*) satisfies 
dP P 
2 = SPP i 2 | 
we differentiate P(t) using the quotient rule. This gives 


dP — (CMke**)(1 + Ce**) — (CMe*)(Cke™) = CMke™ 


dt (1+ Ce*)? (1 + Ce**)?° 


Then, substituting the expression for P into f(t, P), we get 


CMe*t ( Cem )- CMke*t 
( 


t, P) =k ——— [| 1 - —— } = ——— 
f(t, P) 1+Ce*t 1+ Ce 1+ Cekt)2’ 


as required. 


Solutions to exercises 


133 


Unit 2 First-order differential equations 


134 


Solution to Exercise 4 


(a) 


From Exercise 3(c) we know that 


CMe 10C¢e"™ 


= +Cekt 14 Ceo-15t 


is a solution of the differential equation. Because e° = 1, the initial 
condition P(0) = 1 then implies that 


— 10C 
140? 
The particular solution consistent with the initial condition is therefore 


P(t) 


=. ill 
1 sO C= 35. 


Bey 10 60.158 7 10¢0-15¢ 
= 1.0. — 0.15t* 
lp sete Fane 
Dividing top and bottom by e°:!°', we see that 
10 
P(t) = Qe-0-15t 4 1 


For large values of t, the exponential term in the denominator will be 
very small. The result is that P will approach the value 10 in the long 
term. As P is the population size measured in hundreds of thousands, 
the population is predicted to approach one million in the long term. 


Solution to Exercise 5 


(a) 


We apply direct integration to find the general solution. In each case, 
C is an arbitrary constant. 


The differential equation dy/dx = 6x has general solution 
y= | 6rdr = 30? +¢ 


From the initial condition y(1) = 5, we have 5=3+C,so C=2. The 
solution to the initial-value problem is therefore 
y = 3x7 +2. 


The differential equation du/du = e*“ has general solution 
v= peta = ze" +C. 


From the initial condition v(0) = 2, we have 2= 4+C,so0 C = . 
The solution to the initial-value problem is therefore 
v = 4(e™ +7). 


The differential equation y = 5sin 2¢ has general solution 
y= [ssinzeat = —3 cos 2t + C. 


From the initial condition y(0) = 0, we have 0 = —3 + C, so C = 3. 
The solution to the initial-value problem is therefore 


y = 3(1 —cos2t). 


Solution to Exercise 6 


(a) 


The differential equation dy/dx = xe~?* has general solution 


y= fox dx. 


The integral can be found using integration by parts (see Unit 1). 


Since differentiating x simplifies it, we take f(a) = x and g’(x) = e~ 
Then f’(x) = 1 and g(x) = —$e~?*. So, using the formula 


/ f(a) g(a) de = f(e) g(x) - / f'(«) g(a) de, 


we get 


a dx = —Zxe** + i se" dx 
==, hing? 1-2 
= — ve eo ae - + C, 
where C is an arbitrary constant. The general solution of the 
differential equation is therefore 
y = —4(2r + l)e** +C. 


The differential equation p = t/(1 +t?) has general solution 


t 
= dé. 
sf fee 


Using the hint provided, we make the substitution u = 1 + t?, for 
which du/dt = 2t. Writing the required integral as 


t Dt 
—=a = hf di 
om [= ; 


we then obtain 


t 1 du 1 
—_ d= | =-—dt=i|] = 
leeact [age yf Sau 
= flnu+C (sinceu=1+¢#? > 0) 
= $In(1+?t?)+C, 


where C is an arbitrary constant. The general solution of the 
differential equation is therefore 


p= 5ln(1+t?)+C. 


22 


Solutions to exercises 


135 


Unit 2 First-order differential equations 


136 


Solution to Exercise 7 


(a) 


The differential equation is 
dy _ —1 
dx 


ae is of the form dy/dx = g(x) h(y) with g(x) = 1/x and 
h(y) =y—1. For a ) =y—1#0, Procedure 2 gives 


Since x > 0 and y —1 4 0, integration produces 


, where z > 0, 


In|jy—1|=nz+C, 


where C is an arbitrary constant. Taking the exponential of both sides 
of this equation, we get 


Invw+C _ eC ent Cc 


ly—i1j=e =e 


Hence 


Y= I eon. 


We therefore have a family of solutions given by 


y(z) =1+ Ba, 
Cc 


where B = +e” is an arbitrary non-zero constant. 


The special case of the constant function y(x) = 1 is also a solution 
since if we substitute this into both sides of the differential equation, 
we get zero on both sides. This special case can be incorporated into 
the main family of solutions by setting B = 0. 


So we conclude that any function of the form 
y(xz) =1+ Bz 


is a solution of the differential equation for « > 0. In fact, it is the 
general solution of the equation. (In Section 3, Example 7(a), you will 
see this established by a different method.) 


The differential equation is 
dy dy 
dx 1+22’ 


which is of the form dy/dx = g(x) h(y) with g(x) = 1/(1+ 2?) and 
h(y) = 2y. For h(y) = 2y 4 0, Procedure 2 gives 


1 2 
[oo=-/[—ae 


For y 4 0, integration produces 


In |y| = 2(arctanz + C), 


where C is an arbitrary constant. 


Solutions to exercises 


On solving this equation for y, we obtain 


| ,2arctan 2+2C _— 


y = te Le2C 62 arctan © __ Be? arctan x 


2 


be2C 


where B = 4 is a non-zero but otherwise arbitrary constant. 


The special case of the constant function y(x) = 0 is also a solution, 
since if we substitute this into both sides of the differential equation, 
we get zero on both sides. This special case can be incorporated into 
the main family of solutions by setting B = 0. 
So we conclude that any function of the form 

y(x) = Dees 


where B is an arbitrary constant, is a solution of the differential 
equation. In fact, it is the general solution of the equation. (In 
Section 3, Example 7(b), you will see this established by a different 
method.) 


Solution to Exercise 8 


The differential equation is 


dv = utv = eve” 
du 
Dividing through by e” and integrating with respect to u, we obtain Note that e” > 0. 


Jevd= fevdu. 


Integration produces 
—e "=e"+C, 
where C is an arbitrary constant. So 
e °=-e"—-C. 
Taking the logarithm of both sides, we get 
—v = In(-e” —C), 
so 
v = —In(B-e”), 
where B= —C. 


Since the argument of the In function must be positive, we require that 
B-e"“>0,so B>e"“. Hence B must be positive. Taking the logarithm 
of both sides gives In B > Ine“, soln B > u. 


Therefore the general solution is 
v=-—In(B-—e”) (u<IlnB), 
where B is an arbitrary positive constant. 


The initial condition v(0) = 0 gives 0 = —In(B — e®), so B— e° = 1 and 
hence B = 2. The solution to the initial-value problem is therefore 


v=—In(2—e") (u<In2). 


137 


Unit 2 First-order differential equations 


138 


Solution to Exercise 9 


(a) 


Each of these differential equations can be solved by separation of 
variables. 
In this case the differential equation is 
_ du 
ai 
For the cases where u 4 0, we divide through by u and integrate with 
respect to x. This gives 


il 
[adam [eae 
uU 


Integration produces 


wu. 


In |ul = $2? + C, 


where C is an arbitrary constant. On solving this equation for u, we 
obtain 
Let /2+C _ 


2 2 
bel eX /2 = BeX ie. 


where B = +e? 


is a non-zero but otherwise arbitrary constant. 


The special case of the constant function u(x) = 0 is also a solution, 
since if we substitute this into both sides of the differential equation, 
we get zero on both sides. This special case can be incorporated into 
the main family of solutions by setting B = 0. 


So we conclude that any function of the form 

ule) = Be® /?, 
where B is an arbitrary constant, is a solution of the differential 
equation. This is the general solution. 
(You verified that u = 2e*"/? is a particular solution of this differential 
equation in Exercise 2(c).) 
The differential equation is 

. ax 2 

= SS 1: 

di 


We divide through by 1 + 2? and integrate with respect to t. This 
gives 


a 
ope fre 


Integration then produces 


arctanz =t+C, 


where C is an arbitrary constant. On solving for x, we get the general 
solution 


x(t) = tan(t+C). 


We must restrict the domain of x(t) to avoid values of t where tan is 
undefined. That is, we must ensure that t+ C 4 (n+ 4)x, where n is 


an integer, but the precise choice of domain will depend on the initial 
conditions. 


Solution to Exercise 10 


(a) The given equation is 


dP P 
P ap(iF). 
First, note that the constant functions P(t) = 0 and P(t) = M are 
both solutions of the differential equation, giving zero on both sides. 
The function P(t) = 0 is not allowed, however, since we are told that 
P(t) > 0. Ignoring for the moment the possibility that P(t) = M, we 
can use the method of separation of variables to obtain 


| paca? = | Fe 


The integral on the left-hand side can be evaluated using the result 
given in the question with a= 1/M. We get 


1 1 
] =kt+C, 
“IPM 
where C is an arbitrary constant. Hence 
1 1 
S| = eH tH = gC eg htt 
P M 
thus 
1 1 —C —kt 1 —kt 
<< = — B 
2 e€ Vi +Be”™, 


where B is a non-zero but otherwise arbitrary constant. 


Now consider the constant solution P(t) = M. This can be 
incorporated into the above family of solutions by taking B = 0. So 
the restriction B #0 can be dropped and the general solution of the 
differential equation is 


1 -1 
P(t) = (7 + ae) 5 
where B is an arbitrary constant. 
From the initial condition P(0) = 2M, we get 
1 1 1 
=—+Be’, so B=-— 


IM M 2M’ 


The solution to the initial-value problem is therefore 


P)= (7 = ae) a 


2M 
t= 


(b) As t + 00 we have e~*' — 0, so the value of P(t) approaches M. 


Solutions to exercises 


139 


Unit 2 First-order differential equations 


140 


Solution to Exercise 11 

(a) The equation dy/dz + x*y = x” is linear, with g(x) = x? and 
Ga ae 

(b) The equation dy/dx = xsinz is linear, with g(x) = 0 (for all x) and 
h(a) =a sino. 


1/2 term). 


c) The equation dz/dt = —3z1/? is not linear (because of the z 
d) The equation y+ y? = t is not linear (because of the y? term). 

The equation x (dy/dx) + y = y? is not linear (because of the y? term). 
The equation (1 + x7) (dy/dzx) + 2ry = 32? is linear, since we can 
divide through by 1 + x? to obtain 


dy 2ry aa 


dx 1+a@2 144?’ 
which is of linear form with g(x) = 2x/(1 +2?) and 
h(x) = 3x7/(1 + x). 


Solution to Exercise 12 


The differential equation 

dy 22 

Ae +y=e 

has the same form as equation (26), with A = 1 and h(x) = e?”. 


Multiplying both sides of the differential equation by the factor e4* = e*, 
we obtain 


d 
ez ay aa evy ue ere = e. 
dx 
and this can be written in the form 


<(*y) = les dz. 
Integrating both sides then gives 
ery = 840, 
so 
y(x) = 4€°* +Ce™. 


The initial condition y(0) = 0 gives 0 = - +C,soC= —. The particular 
solution consistent with the given initial condition is therefore 


ga—= se" Ee). 
We check this by differentiating y(x) to get 


d 
oe = $(2e7* + e”), 
so 
dy al 22 -—2£ 1/22 —2x£ 2x 
ots = 3(2e ane Jesle ~E *) =e", 


as required. 


Solution to Exercise 13 


(a) 


The given equation is 
dy 
——y=e' sing. 
de 7 


Comparison with equations (25) and (32) shows that the integrating 
factor is 


ees ( i (-1) ir) a ee 


Multiplying through by p(x) then gives 
dy 
dx 
Thus the differential equation can be rewritten as 

d 

dx 
(You should check that the derivative on the left-hand side is equal to 
the left-hand side of the preceding equation.) 


—2 


—e “y=sinz. 


(ey) =sinz. 


On integrating, we find the general solution 
e “y=—cosr+C, 

or equivalently, 
y(x) = e”(C —cosz), 


where C is an arbitrary constant. 


The given equation, when rearranged into the form of equation (25), is 


This has the same left-hand side as the differential equation in 
part (a), and hence the same integrating factor, p(x) = e-”. 
Multiplying through by p(x) gives 


dy 
~*~ _e *y=x2e *. 
dx e 
Thus the differential equation can be rewritten as 
d meet Et —= 15 
—(e =e 
me ¥) 


Integrating both sides, we get 


e “y= foo dx. 


The integral on the right-hand side is integrated by parts, using the 
formula 


[f@s@ae=s eases 


with f(a) = 2 and g'(z)=e" 


Solutions to exercises 


141 


Unit 2 First-order differential equations 


We have f’(x) = 1 and g(x) = —e~*, so 


e “y=—xe 7+ a dx 
=-nre *-e 74+C 
=C-(#+l1)e”, 


where C is an arbitrary constant. Multiplying through by e”, the 
explicit form of the general solution is 


y = Ce® — (x +1). 
Solution to Exercise 14 


(a) The given equation, when rearranged into the form of equation (25), is 
du 

dx 

The integrating factor is 


p(x) = exp ( / (—2) i) = exp(—2?/2) = e7* /2, 


Multiplying through by p(x) gives 


—azu=0. 


eo 2/2 a — ge~® /2y = 0. 
dx 
Thus the differential equation can be rewritten as 
d 
ale 7u) =). 


(You should check that the derivative on the left-hand side is equal to 
the left-hand side of the preceding equation.) 


On integrating, we find the general solution 
eo Py = C,; 

or equivalently, 
uz) = Cer /?, 

where C is an arbitrary constant. 


From the initial condition u(0) = 2, we have 2 = Ce°, so C = 2. Hence 
the solution of the initial-value problem is 


ule) = oe 


(b) After division by t (which is allowed because we are told that t > 0), 
the given equation can be written as 


The integrating factor is 


pay ep ( / : ar) Jenne sepa ya? 


142 


Solutions to exercises 


Multiplying through by p(t) gives 
dy 
P= 4+ 2ty=t. 
ae 


Thus the differential equation can be rewritten as 


d 2 _— 43 
He wae. 


(Again, it’s worth checking that the derivative on the left-hand side 
gives the left-hand side of the preceding equation.) 


On integrating, we find the general solution 
Py = 4th +C, 

or equivalently, 
y(t) = 40? + Ct’, 

where C is an arbitrary constant. 


From the initial condition y(1) = 1, we have 1 = - +C,so C= 3. 
Hence the solution of the initial-value problem is 


y(t) = F(¢? + 3t7?). 
Solution to Exercise 15 


(a) The given equation is 


are ae 
ae 


So the integrating factor is 


p(t) = exp (/ 1 it) = exp(t) =e’. 


Multiplying through by p(t) gives 


dy 
t a Let. 
e at ey (t+ l)e 


Thus the differential equation can be rewritten as 


“(ety) = (t+ 1)e’. 


So 
ey = fe + Le’ dt. 


The integral on the right-hand side is evaluated by parts, using 
f(t) =t+1 and g'(t) =e'. We have f’(t) =1 and g(t) = e', so 


e'y = (t+ 1)e’— Je dt 
=(t+le’-e +C 
=te iC, 


where C is an arbitrary constant. 


143 


Unit 2 First-order differential equations 


Multiplying through by e~‘, the general solution in explicit form is 
y(t) = Ce* +#. 
From the initial condition y(1) = 0, we have 0 = Ce~' + 1, so C = —e. 
Hence the solution of the initial-value problem is 
y(t) =t—e', 

(b) After division by e* and rearrangement, the given equation becomes 
dy/dt + y = e~**. This has the same left-hand side as the differential 
equation in part (a), and hence the same integrating factor, p(t) = e’. 
Multiplying through by p(t) gives 

dy 

dt 

Thus the differential equation can be rewritten as 


2t 


et + e'y =e 


qew) =e. 

On integrating, we find the general solution 
ey = —te*" pee, 

or equivalently, 
y(t) = Ce *— se, 

where C is an arbitrary constant. 


From the initial condition y(0) = 3, we have 3 = Ce® — $e°, so C = £. 
Hence the solution of the initial-value problem is 


y(t) = E(7e* — e-%), 


Solution to Exercise 16 


(a) After division by x (where x > 0), the given equation becomes 
dy/dx — (3/x)y = 1. The integrating factor is therefore 


wormon({( 2) 


=exp(—3lnz) (since x > 0) 
= exp(In(«~?)) 
=, 


Multiplying through by p(x) gives 


dy 
-3 —4 3 
— — 34 ah 
dx z 
Thus the differential equation can be rewritten as 
d 3 3 
x =o7°": 
met ¥) 


On integrating, we find the general solution 


a °y = $277 +0, 


144 


or equivalently, 


y(x) = Cz? — $2, 
where C is an arbitrary constant. 


(b) The given equation is dv/dt + 4v = 3cos 2t. The integrating factor is 


p(t) = exp ([ sa) = exp(4t) = e*. 


Multiplying through by p(t) gives 

d 

at OU + dey = 3e* cos 2t. 

dt 
Thus the differential equation can be rewritten as 
d 
ric) = 3e** cos 2t. 
On integrating (using the hint for the right-hand side, with a = 4 and 
b = 2), we find 

ety = Ze" (4cos 2t + 2sin 2t) + C, 


where C is an arbitrary constant. Multiplying through by e~**, the 
general solution in explicit form is 


u(t) = 4(2cos 2t + sin 2t) + Ge 


Solution to Exercise 17 
(a) and (d) require the integrating factor method. 
(b) is best solved by direct integration. 


(c) can be solved by separation of variables or the integrating factor 
method. 


(e) requires separation of variables. 


Solution to Exercise 18 


(a) The diagram shows that the slope is zero at all points on the 
horizontal lines P = 0 and P = 1000, so these correspond to constant 
solutions of the differential equation. (As pointed out earlier in the 
text, these two solutions can also be spotted directly from the form of 
the differential equation.) 


The graphs of solutions through a starting point above the line 
P = 1000 appear to decrease, but at a slower and slower rate, tending 
from above towards the asymptote P = 1000 as t increases. 


The graphs of solutions through starting points in the region 

0 < P < 1000 are increasing, with slope growing before the level 
P = 500 is reached and declining thereafter. For large values of t, 
these graphs tend from below towards the asymptote P = 1000. 


For a starting point in the region P < 0, the graphs decrease without 
limit and with steeper and steeper slope. 


Solutions to exercises 


145 


Unit 2 First-order differential equations 


These various cases are illustrated in the figure below. 


4f42- 
|/77ee- 
|77e7e- 
|77-7- 
|77-- 
|77-7e- 
|77e-e- 
|472- 
|/s7e-- 
|/47e-- 
|77-- 
|77-7e- 
|77-7e- 
|77e-e- 
l77e-e- 
|772- 


\ 

\ 
\ 
N 
— 


\ 
\ 
\ 
\ 
~~ 


500 


JAR eS SS 
V\NANAA | 
AWANNNAA A 
VAAN ANNAA A 
VAAN ANNAN A 
VVANXANNAA A 
VANN AA AVA A 
VVNNANXAN \ 
VV\NANNYXA \ 
VVANNNNA 
VV\NNNNAA\ 
VVANNANNAA 
VANNANNAAA 
VANNANANAN A 
VVNNANNAN A 
VA\NNANNAAN A 
VVANNNNAA A 
VV\NNNNAA A 
VANNANNAAVA 
VANNNNNNAVEL I 4422- 


10 t 


4 
\ 
\ 
\ 
\ 


-—“"L47S/Dd 


8 
‘ \ 
\ \ 
\ \ 
4 \ 


---7//] 
--7/| 
--7/7/| 
--7/7|/ 


~~ 
N 
\ 

\ 
\ 


--7/ | 
--7/7| 
--7/ | 
--7/7| 
--7/| 
--7/7| 
--7/7| 
--77/| 


N 
\ 
\ 
\ 


—500 


(b) If the differential equation is considered as a model of population 
behaviour, then the region P < 0 must be excluded. The above 
analysis leads to the following predictions for the population. 

e Ifthe population size is zero at the start, then it remains zero. 


e Ifthe population size is 1000 at the start, then it remains fixed at 
this level. 


e Ifthe population starts at a level higher than 1000, then it 
declines (more and more gradually) towards 1000. 


e If the population starts at a level below 1000 (but above 0), then 
it increases (more and more gradually) towards 1000. 


e Solutions with P < 0 are unphysical in the context of population 
models. 


146 


Acknowledgements 


Acknowledgements 


Grateful acknowledgement is made to the following sources: 
Figure 3: Andrew Butko. 


Figure 4: Wallace 63 / 
http: //commons.wikimedia.org/wiki/File:Smilodon_head.jpg. This file is 
licensed under the Creative Commons Attribution-Share Alike 3.0. 


Figure 7: NASA. 
Figure 8: NASA. 


Figure 17(a): Taken from 

http: //en.wikipedia.org/wiki/File:Cray_1IMG_9126.jpg. This file is 
licensed under the Creative Commons Attribution-Share Alike 2.0 France 
licence. 


Figure 17(b): Taken from 

http: //en.wikipedia.org/wiki/File:Keisoku-Fujitsu.jpg. This file is licensed 
under the Creative Commons Attribution-Share Alike 3.0 Unported 
licence. 


Every effort has been made to contact copyright holders. If any have been 
inadvertently overlooked, the publishers will be pleased to make the 
necessary arrangements at the first opportunity. 


147 


Unit 3 


Second-order differential equations 


Introduction 


In this unit we move from first-order differential equations to second-order 
differential equations, that is, differential equations involving a second (but 
no higher) derivative. Examples of such equations are 
2 2 

ou _ 3 +2y=4e” and sot +y=arsine. 
Second-order differential equations play a central role in the physical 
sciences. They are found, for example, in laws describing mechanical 
systems, wave motion, electric currents and quantum phenomena. 


To take a simple case, consider a particle of mass m that moves in one 
dimension along the z-axis. At any given time t, the particle’s position is 
x(t), and its velocity and acceleration are given by the derivatives dx/dt 
and d?x/dt?. There are no general laws for the position or velocity of the 
particle, but there is a very important law for its acceleration: Newton’s 
second law tells us that 


mass X acceleration = force, 

which implies that 
dx 

where F is the force acting on the particle. The force need not be 
constant, and may vary with the position x or the velocity dx/dt of the 
particle. So, depending on the precise details, we get a second-order 
differential equation for x as a function of t, and the solution of this 
equation tells us how the particle can move. 


The system known as a simple harmonic oscillator provides a good 
example. Here, a particle of mass m is suspended at the lower end of a 
spring that is attached to a fixed support (Figure 1). The particle moves 
up and down along a vertical x-axis, subject to a force F’ provided by the 
spring and gravity. If the system is left to settle, the particle comes to rest 
at a point of equilibrium, which we label « = 0. Because the particle does 
not spontaneously move away from this position, we can infer that Ff’ = 0 
when x« = 0. 


When the particle is displaced from x = 0, the force F' tends to draw the 
particle back towards x = 0. We consider the case where the force is 
proportional to the displacement from equilibrium, and take 


F =—kz, (2) 


where k is a positive constant. The negative sign in this equation ensures 
that the force always acts in a direction that tends to restore the particle 
to its equilibrium position. 


Putting equations (1) and (2) together, we get the differential equation 


Introduction 


The order of a differential 
equation is defined in Unit 2. 
A second-order differential 
equation may or may not 
include a first derivative. 


Don’t worry if you have not met 
Newton’s second law before: the 
essential aims of this unit do not 
rely on it. 


“aN 


Figure 1 A particle of 
mass m moves along the 
x-axis subject to a force F 
provided by a spring and 
gravity 


151 


Unit 3. Second-order differential equations 


1h2 


Recalling that k > 0 and m > 0, we can also express this as 

dx 9 

We = —-W z, (3) 
where w = ,/k/m is a positive constant. Equation (3) is called the 


equation of motion of a simple harmonic oscillator. It is a second-order 
differential equation whose solution tells us how the particle can move. 


This unit develops systematic techniques to solve equations like this. For 
the moment, we will simply guess the solution and check that it works. 
You know that 


d d 
—(sint)=cost and —(cost) = —sint, 


dt dt 
so 
dP Oy 
ae (sin t)=-—sint and ay (08 t) = —cost. 


In other words, taking the second derivative of a sine or cosine function 
gives the same function back again, but with a minus sign. This is very 
close to the behaviour needed to solve equation (3). We therefore try a 
function of the form 


x(t) = Csin(wt) + Dcos(ut), (4) 


where C and D are any constants, and w is the constant in equation (3). 
Differentiating this function once, and then again, we get 
dx 
dt 
dx 
dt? 
So the function in equation (4) does indeed satisfy equation (3). In fact, it 
is the general solution of this differential equation. 


= Cwcos(wt) — Dwsin(wt), 


= —Cw* sin(wt) — Dw* cos(wt) = —w2x. 


Notice that our solution involves two constants, C and D, whose values are 
not specified. These constants have arbitrary values, and they are called 
arbitrary constants. It is typical for the general solutions of a second-order 
differential equation to have two arbitrary constants. The values of these 
constants depend on how the system is released, and you will see how they 
are determined later in this unit. To take a definite case, suppose that 

C = 2, D=3 and w = 4, when measured in suitable units. Then we have 
the particular solution 


x(t) = 2sin(4t) + 3cos(4t), (5) 


and this is plotted in Figure 2. The wiggles in this graph correspond to the 
oscillations performed by a system like the particle on the end of the 
spring in Figure 1. 


Figure 2. A graph of equation (5), which is a particular solution of 
equation (3) 


The simple harmonic motion described by equation (4) continues forever, 
but we know from everyday experience that oscillations generally die away 
after a while. We can get a more realistic description by including an 
additional force in our model — one that will dampen the oscillations down. 
We take the additional force to be —ydx/dt, where ¥ is a positive 
constant, leading to the differential equation 


—, = —kr-y—. (6) 


This is the equation of motion of a damped harmonic oscillator. Later in 
this unit you will see that (in appropriate circumstances) it has solutions 
that oscillate but diminish and eventually die away. 


We can also add in another force, f(t), which is applied to the particle by 
some external agency. We assume that this force is known directly as a 
function of time (and does not depend on the position or velocity of the 
particle.) Under these circumstances, Newton’s second law leads to the 
differential equation 


~ke — 9 + FH), (7 


and this is the equation of motion of a forced damped harmonic oscillator. 
If the external force is oscillatory, the response of the system may depend 
very sensitively on the frequency of the external force. This phenomenon is 
called resonance; it will be explored at the end of this unit. 


Harmonic oscillators are everywhere 


Harmonic oscillators play a central role in physics and its 
applications. If a system performs small oscillations about an 
equilibrium point, then it is generally a good approximation to model 
it as a harmonic oscillator, including the additional terms in 
equations (6) and (7) when necessary. 


Introduction 


153 


Unit 3. Second-order differential equations 


It should come as no surprise that the to-and-fro motion of a 
pendulum clock can be modelled by a harmonic oscillator. On a 
smaller scale, vibrating molecules and vibrating crystals are also 
modelled as harmonic oscillators. 


Equations similar to (7) are also used to describe the oscillations of 
currents in electrical circuits that allow radios to be tuned to selected 
stations. Moreover, each frequency in an electromagnetic field can be 
regarded as a harmonic oscillator, and this is a key insight used in 
advanced physics when electromagnetic fields are treated quantum 
mechanically. 


A more everyday example is provided by the suspension system of a 
mountain bike (or any other vehicle). The rider is protected from the 
vibrations caused by a rough track by a rugged suspension system, 
such as that shown in Figure 3, and this can also be modelled by 
equation (7). 


Figure 3. The suspension system of a mountain bike is designed to 
absorb shocks. The rear part of the frame can rotate about a pivot 
point, compressing or extending the spring. 


Study guide 


This unit requires no previous knowledge beyond that needed for Unit 2, 
apart from some familiarity with complex numbers. The relevant material 
on complex numbers was reviewed in Unit 1. 


The differential equations discussed in this unit all belong to one broad 
class: they are all linear constant-coefficient second-order differential 
equations. These equations play such an important role in physics and 
areas of applied mathematics that they easily deserve a unit to themselves. 


154 


Section 1 introduces some basic principles and terminology. Sections 2 
and 3 give methods for finding the general solutions for our class of 
second-order differential equations: Section 2 covers so-called homogeneous 
equations, while Section 3 covers inhomogeneous equations. Section 4 then 
explains how extra information can be used to help us to select particular 
solutions that are appropriate in given situations. 


For the most part, the unit presents the topic of second-order differential 
equations in purely mathematical terms, and its learning aims can be fully 
met without reference to physical laws. However, it is illuminating to see 
physical interpretations of the mathematical equations, so Subsection 2.4 
and Section 5 revisit the oscillators considered in the Introduction. 


Although the differential equations discussed in this unit and the last may 
seem to belong to a limited range of classes, in practice they encompass 
most of what a mathematical scientist needs to know about differential 
equations. The results obtained in this unit will be used again in Units 6 
and 12. 


1 Some preliminary remarks 


As for first-order differential equations, second-order differential equations 
can be written using a variety of notations for functions and derivatives. 


If t is the independent variable and y is the dependent variable, we can 
regard y as a function of t and write y = f(t). More usually, however, we 
write y = y(t), using the same symbol y for both the variable and the 
function. (The merits of this notation were discussed in Unit 1.) The first 
derivative of y with respect to t may be written as dy/dt, y or y’, and the 
second derivative as d?y/dt?, y or y’”. 


This section makes some general comments that will be important for 
understanding the methods introduced in Sections 2 and 3. 


1.1 Requirement for two arbitrary constants 


In Unit 1 you saw that when we solve a first-order differential equation, we 


get a general solution containing one arbitrary constant. The value of the 


arbitrary constant is undetermined in general, but it can be found by using 


additional information provided by an initial condition. Here we consider 
the corresponding result for second-order equations. You saw in the 
Introduction that the differential equation for a simple harmonic oscillator 
(equation (3)) has a general solution (equation (4)) that contains two 
arbitrary constants. This turns out to be the general rule. 


You can assume that the general solution of any second-order 
differential equation contains two arbitrary constants. 


1 Some preliminary remarks 


Of course, the independent 
variable is not always t, and the 
dependent variable is not 
always y! 


155 


Unit 3. Second-order differential equations 


156 


To further illustrate this point, consider the differential equation 

dy 
ae” 8) 
where ¢ is the independent variable, and a is a given constant. This 
equation describes the motion of a particle with constant acceleration a. 
To take a definite case, we will measure distance in metres and time in 
seconds, and suppose that a = 3 in these units. Our differential equation 
then becomes 

dy 

dt? 
An equation like this can be solved by integrating both sides twice. The 
first integration gives 


dy 
— = — 2 
7” [oa 3t+C, 


where C is an arbitrary constant. A second integration then gives 


29. (9) 


y(t) = Jo + C) dt = 3t?+Ct+D, (10) 


where D is another arbitrary constant. Equation (10) is the general 
solution of the differential equation. It gives a formula that describes the 
collection of all possible solutions of the equation. As promised, this 
formula contains two arbitrary constants, C and D. 


If we take definite values for C and D, we get a particular solution of the 
differential equation. For example, if we take C = 2 and D = 3, then the 
particular solution is 


— 3742 
y(t) = 34? + 2t+3. 


In the context of straight-line motion, this solution tells us where the 
particle is located at each instant t. Moreover, differentiating both sides 
gives 


so it also gives the velocity dy/dt of the particle at each instant t. 


How do we determine the arbitrary constants? You have seen that a 
second-order differential equation has two arbitrary constants, so we need 
two pieces of information to determine them. A whole section of this unit 
(Section 4) is devoted to ways of finding the arbitrary constants, but we 
will make a brief comment now. 


One way of finding the arbitrary constants is to use initial conditions. 
However, it is not enough to specify the value of the function y(t) at a 
fixed time t = to. We also need extra information, and this can be provided 
by giving the value of the derivative dy/dt at the same fixed time, t = to. 


1 Some preliminary remarks 


Example 1 


The following description of motion is based on units of metres and 
seconds. At t = 0, a car has initial position y = 100 and initial velocity 

u = 4. Between t = 0 and t = 10, the car travels with constant acceleration 
a= 3 along a straight road. 


Find the particular solution of equation (9) in this case, and use it to 
predict the car’s position and velocity at t = 10. 


Solution 
Following the above calculation, the general solution of equation (9) is 
y(t) = 307+ Ct+D, (Eq. 10) 
where C and D are arbitrary constants. The initial conditions are 
y(0) = 100 and y’(0) = 4. The first condition gives The condition y’(0) = 4 could 


4 also be written as 4(0) = 4 or as 
100=5x0+Cx0+D=D. dy : 


Differentiating equation (10) gives y/(t) = 3t + C, so the second condition dt |4-o 
gives 


4=3x04+C=C. 


We conclude that the arbitrary constants have values C = 4 and D = 100. 
The required particular solution is therefore 


y(t) = 3t? + 4t + 100. 
Differentiating this function gives the car’s velocity as a function of time: 
y (t) = 3t +4. 


We obtain y(10) = 290 and y/(10) = 34. So at time 10 seconds, the car’s 
position is 290 metres and its velocity is 34 metres per second. 


The solution of second-order differential equations is rarely as easy as the 
solution of equation (9). In fact, the approach of repeated direct 
integration works only for equations of the form 


and requires that both integrations can be carried out. 


Initial conditions and Newton’s second law 


You have seen that Newton’s second law leads to second-order 
differential equations, and that the general solution of a second-order 
differential equation contains two arbitrary constants. 


157 


Unit 3. Second-order differential equations 


Compare the definitions for 
first-order equations in Unit 2. 
The important feature is the 
linear combination of y and its 
derivatives on the left-hand side. 


If a = 0, then the equation is 
first-order. 


The term non-homogeneous is 
sometimes used instead of 
inhomogeneous. 


158 


The need for two arbitrary constants connects to everyday experience. 
If you throw a stone upwards, and want to predict its future motion, 
it is not enough to know where the stone was released. You need to 
know both its initial position and its initial velocity. These two bits 
of information are sufficient to determine the two arbitrary constants 
in the general solution, and hence select a unique particular solution. 


A single differential equation has a multitude of particular solutions, 
distinguished by different arbitrary constants. In the context of 
mechanics, this is a great unifying feature. We do not need different 
rules to explain the various motions of falling apples, cricket balls, 
asteroids or orbiting satellites. All these motions can be modelled by 
the same differential equation (obtained from Newton’s second law 
and the law of gravitation). If the motions are different, it is because 
the initial conditions are different. In a nutshell: a differential 
equation provides unity, and initial conditions provide variety. 


1.2 Linearity and superposition 


This unit considers second-order differential equations that are linear and 
have constant coefficients. You met linear constant-coefficient equations in 
Unit 2 in the context of first-order differential equations. But what do the 
terms ‘linear’ and ‘constant-coefficient’ mean in the context of 
second-order equations? The answer lies in the following definitions. 


Definitions 


e A second-order differential equation for y = y(«) is linear if it 
can be expressed in the form 
d? d 
a(x) 5-7 + (2) = +(e) y = f(a), 
where a(x), b(x), c(x) and f(a) are given continuous functions, 
and a(x) is not equal to the zero function. 


eA linear second-order differential equation is said to be 
constant-coefficient if the functions a(x), b(x) and c(x) are all 
constants, so that the equation is of the form 
d’y dy 
Oe gee. = 11 
ad +0 + cy = F(a), (11) 
where a # 0. 


e_ A linear constant-coefficient second-order differential equation is 
said to be homogeneous if f(x) = 0 for all x, and 
inhomogeneous otherwise. 


1 Some preliminary remarks 


Linear constant-coefficient second-order differential equations can be 
written in other ways. For example, we can divide equation (11) through 
by a to obtain an equation of the form 
d?y 
dx? 
where 6 and ¥ are constants, and this more closely resembles the definition 
of linear first-order differential equations from Unit 2. 


+64 a + 9y = oe 2, 


Exercise 1 


Consider the following second-order differential equations. 


es y y 2 y y 
ag © aaa gre WO ee eee 
2 
(v) ay” $2°y=0 (v) SH bay = 3% (wi) ay 4 ty = 3 
d*t dt 
(vii) 2 +35 + At = sind (viii) @=—4t (ix) = —4e 


For each equation, identify the dependent and independent variables. 
Which of the equations are linear? 
c) Which of the equations are linear and constant-coefficient? 


d) Which of the linear constant-coefficient equations are homogeneous? 


The principle of superposition 


We now introduce a key principle that will turn out to be extremely useful 
throughout this unit. The principle is a fundamental property of all linear 
differential equations, but we discuss it here in the context of linear 
constant-coefficient second-order equations. 


Suppose that we have a solution y;(x) of the differential equation 


dy dy su 
a + b= +cy = fi(x), In principle, the coefficients a, b 
and c could be functions of x, 
and a solution y2(x) of the differential equation but our interest in this unit is 
fy a confined to the case where they 
are constants. 
as +b +cy = fo(x). 


Then the principle of superposition states that any linear combination 
ky yi(a) + ke yo(x), where ky and ke are constants, is a solution of the 
differential equation 

d2 


d 
aay tb +ey = hy fi (x) + ke fo(x). (12) 


159 


Unit 3. Second-order differential equations 


It is not difficult to see why this is true. If we substitute kyy, + kaye into 
the left-hand side of equation (12), we get 
2 


d d 
asa (ky + koy2) + b— (kay + kayo) + e(kiys + kaye) 


dy; d? yo dy dy 
=a (uo + in | +b (nH =P in?) + c(kiy1 + koy2) 


dy, dy dy. dye 
=k, (« ae + Oe + on) + ke (“4 + aon + cy2 
= ky fi(a) + ke fo(2), 


as required. 


This important result is summarised as follows. 


The principle of superposition 


If yi(x) is a solution of the linear second-order differential equation 


dy dy 
ED ata oT == Gi) = fileale 
and y2() is a solution of the linear second-order differential equation 
ay ey 
ee Pr Dae apr ey, — fo(x) 


(with the same left-hand side), then the function 
y(x) = ky yi (a) + ke yo(x), 


where ky and ko are any constants, is a solution of the differential 


equation 
d? d 
— ar _ oa (CU) = ky fi (az) + ko fo(2). 


The power of this principle is obvious: it enables us to find new solutions 
by adding together existing ones. 


An important special case arises when f)(x) = fo(a) = 0. In this case we 
see that if y;(a”) and yo2(x) are both solutions of the homogeneous equation 
dy dy 
Se hp =, 13 
"dx? a dx wa (13) 
then any linear combination 
y(x) = ki yi (x) + ke yo(x), 


where k; and kg are constants, is also a solution of the same homogeneous 
equation (13). 


160 


2 Homogeneous differential equations 


2 Homogeneous differential equations 


This section develops a method for finding the general solutions of 
homogeneous linear constant-coefficient second-order differential equations. 
Section 3 will consider the general solutions of inhomogeneous equations, 
and Section 4 will discuss how particular solutions are selected in given 
situations. 


Before the method for homogeneous equations is described in detail, it is 
helpful to look at two simple cases. 


2.1 Two simple cases 


In this section, we return to the equation of motion of a simple harmonic 
oscillator. Before tackling this, however, we will solve the closely-related 
equation 

d?y 

dt? 
where w is a given positive constant. This differs from the equation for a 
simple harmonic oscillator only in the sign of the coefficient of y, which is 
negative in this case. 


~ wy =0, (14) 


Our method of solution is very simple. We notice that a function y = e*, 


where A is any constant, can be differentiated, and then differentiated 
again, to give 

dy At dy 2 At 

ae = Xe and de = \*e’. 
If we substitute this function into the differential equation (14), we get 


Me tau e* =0. 


161 


Unit 3. Second-order differential equations 


162 


The exponential factors can be cancelled (because they are never equal to 
zero) and we are left with a simple algebraic equation for A: 


Mw -—w* =0. 


This equation has two solutions: \ = w and A = —w. So we have found two 
distinct solutions of equation (14), namely 


t wt 


y=e" and y=e 


The argument then goes as follows: 


e The differential equation is homogeneous, so the principle of 
superposition guarantees that any linear combination of the solutions 
y =e and y = e is also a solution of the differential equation. 
Hence the function 


y(t) = Ce + De™, (15) 
where C and D are arbitrary constants, satisfies equation (14). 


e Then remember that the general solution of a second-order differential 
equation contains two arbitrary constants. The function 
y(t) = Ce" + De~ satisfies the differential equation and contains 
two arbitrary constants. This means that it is the general solution of 
equation (14). The arbitrary constants could take any values, but if 
we want our solution to be real-valued, they must be real. 
Now we consider the equation of motion of a simple harmonic oscillator, 
which can be written in the form (from equation (3)) 
dy 
dt? 


where w is a given positive constant. 


+u*y =0, (16) 


The tactics for solving this equation are just the same. We try a solution 
of the form e*”, where \ is an undetermined constant. Substituting this 
into the differential equation, we obtain 

Me™ ue" =U. 
Then, cancelling the (non-zero) exponential factors, we get the algebraic 
equation 


M+? =0. 


The solutions of this equation are the complex numbers A = iw and 
A = —iw, and corresponding to these we have two distinct solutions of 
equation (16): 

y=e* and y=or™. 
Using the principle of superposition, and the fact that the general solution 
of a second-order differential equation contains two arbitrary constants, we 
can follow an argument like that given above to conclude that the general 


solution of equation (16) is 


y(t) = Ae + Be“, (17) 


2 Homogeneous differential equations 


where A and B are arbitrary constants (which may be complex numbers in 
this case). 


There is nothing wrong with this solution, but we can put it in a more 
familiar form by using Euler’s formula 


e” =cosx+isinz. 


Substituting « = wt gives 


et — cos(wt) + isin(wt), 
while substituting 7 = —wt gives 
e ™ — cos(—wt) + isin(—wt) = cos(wt) — isin(wt). 


We therefore see that the general solution is 
yt) =Ae" +. Be ™ 
= Alcos(wt) + isin(wt)] + Blcos(wt) — isin(wt)] 
= (A+ B) cos(wt) + 1(A — B) sin(wt), 
and this can be expressed as 
y(t) = Ccos(wt) + Dsin(wt), (18) 


where C = A+ B and D =i(A — B) are arbitrary constants. If y(t) is 
real, then the constants C and D are real-valued. In this case, the 
constants A and B are not real-valued, but that does not matter. The 
important thing is that we have obtained the general solution of 

equation (16) in the form of equation (18) and this, of course, agrees with 
the solution given in the Introduction. 


2.2 Solution in the general case 


The method just described works far more generally. Suppose that we wish 
to solve the differential equation 


ate y dy 
a 0, 19 
any + bo + oy = (19) 
where a, b and ¢ are constants, with a #4 0. This is the general form of a 
homogeneous linear second-order equation with constant coefficients. 


Then we start by substituting the trial solution 
y= er 
where A is an undetermined constant, into the differential equation. We 
have dy/dx = \e*” and d?y/dx? = d7e*”, so substituting y = e into the 
left-hand side of equation (19) gives 
at dy 
us Ey ee (ve ehh ae 


“dx | dx 
= (ad? + dA + c)e* 


For variety, we use the symbol x 


for the independent variable. 


163 


Unit 3. Second-order differential equations 


The auxiliary equation is 
sometimes called the 
characteristic equation. 


164 


Hence y = e* is a solution of equation (19) provided that 2 satisfies the 


quadratic equation 
OW Lb +e= 0. (20) 


This equation plays such an important role in solving linear 
constant-coefficient second-order differential equations that it is given a 
special name. 


Definition 
The auxiliary equation of the homogeneous linear 
constant-coefficient second-order differential equation 
d’y dy 
a— +b—+cy=0 
ax a ee 
is the quadratic equation 


ad? +b\+c=0. 


There is no need to write down all the steps that led to the auxiliary 

equation. You can just use the rules that emerge from the calculation 
leading to equation (20): the auxiliary equation is obtained from the 

differential equation by 


@¢ d 
replacing a2 by 22, a by A, and y by 1. 
dx? dx 
Example 2 
Write down the auxiliary equation of the differential equation 
dy ody 
3— —2— + 4y =0. 
dx? dx ve 
Solution 


The auxiliary equation is 


37 -214+4=0. 


Exercise 2 


Write down the auxiliary equation of each of the following differential 
equations. 

dy dy 
(a) 


Te Pgp tO¥=9 = (b) y"-HV=0 (ce) F428 =0 


2 Homogeneous differential equations 


We know that y = e*” is a solution of equation (19) provided that 
satisfies the corresponding auxiliary equation. But the auxiliary equation 
is a quadratic equation, so it has two roots, A; and A» say. For the 
moment, we assume that these are distinct: Ay # Ag. Corresponding to 
these roots, there are two distinct solutions of differential equation (19): 


A1x Ax 


yi(x) =e and yo(z) =e 


We now follow the logic of the preceding subsection. The principle of 
superposition implies that any linear combination of yi(x) and yo(x) 
satisfies the differential equation. It therefore follows that the function 


y(x) =Cyi(x) + Dy(x), (21) 


where C and D are arbitrary constants, satisfies the differential equation. 
This solution contains two arbitrary constants, as expected for the general 
solution of a second-order differential equation. We therefore conclude that 
equation (21) is the general solution of equation (19). This important 
result is summarised as a theorem. 


Theorem 1 General solution of homogeneous equations 


Given a homogeneous linear second-order differential equation 


d 
a—, +b—+cy=0, 
z 
with constant coefficients a ~ 0, b and c, the auxiliary equation is 
ad” +bA+¢=0. 


This usually has two distinct roots, A; and A2, associated with two 

distinct solutions y,(x2) = e*!” and yo(x) = e*2”. Provided that the 

roots are distinct, the general solution of the differential equation is 
y(z) = Ce™® + De, (22) 


where C and D are arbitrary constants. 


The roots of the auxiliary equation are 
—b+ Vb? — 4ac a 3 —b — Vb? — 4ac 
= eee 2 = — ——_. 
2a 2a 


Assuming that the coefficients a, b and c are real, there are three cases to 
consider, depending on the sign of the discriminant b? — 4ac: 


Al (23) 


e For b? — 4ac > 0, the roots are distinct and real. 
e For b? — 4ac < 0, the roots are distinct and complex. 
e For b? — 4dac = 0, the roots are equal and real. 


We consider each of these cases in turn. 


It does not matter which of the 
roots is called A, and which is 


165 


Unit 3. Second-order differential equations 


Distinct real roots 


Example 3 

(a) Write down the auxiliary equation of the differential equation 
dy dy 
Sn ie yp 
dz? dz a 


and find its roots A; and pg. 


(b) Use Theorem 1 to write down the general solution of the differential 
equation, and verify that your answer does satisfy the differential 
equation. 


Solution 


(a) The auxiliary equation is 


N —3A4+2=0. 
Using the formula This equation may be solved, for example, by factorising it in the form 
(ea (X-1)(A-2) =0, 
produces the same nee to give the two roots Ay = 1 and A» = 2. 


(b) Since Ay = 1 and Az = 2 are the roots of the auxiliary equation, the 
functions y; = e* and y2 = e?” are solutions of the differential 
equation. Theorem 1 then shows that the general solution of the 
differential equation is 


y(x) = Cyi(x) + Dyp(a) 
= Ce™ + De”, 
where C and D are arbitrary constants. 


To check that this function satisfies the differential equation, we 
differentiate it and then differentiate again, to get 


d d? 
oe Ce* +2De** and lle Ce Ape". 


dx dx? 
Substituting into the left-hand side of the differential equation then 
gives 

d*y dy 

— —3— +2 

dx? dz ao 


= (Ce* + 4De?*) —3 (Ce* + 2De?") +2 (Ce* + De?*) 
= C(1—3+2)e” + D(4—6 + 2)e* 


as required. Hence y = Ce* + De?* is a solution of the differential 
equation, for all values of C' and D. 


166 


2 Homogeneous differential equations 


Exercise 3 


Use the auxiliary equation and Theorem 1 to find the general solution of 
each of the following differential equations. 

dy dy d?y dy d?z 

— +5— + by =0 b) 2-53 +3— =0 —, —4z=0 
(a) Fa ths + by (b) 255+ (c) Ga 4 


Distinct complex conjugate roots 


When the discriminant 6? — 4ac is negative, equations (23) produce two 
complex roots that are complex conjugates of one another: 


Ay=at Pi and A.»=a- fi, Recall that the complex 


jugate of a + Bi is a — Bi. 
where a and ( are real. The corresponding functions y;(«) = Ae*! and eae eye 


yo(x) = Be* satisfy the differential equation, and the general solution 
takes the form 


y(v) = Ae™® + Be®® = Ae(o+Fi)® 4 Belo-Bie, (24) 


where A and B are arbitrary constants (which may be complex in this 
case). We usually need a real-valued solution, so it is best to express our 
result without using complex numbers. This can be achieved by first 
writing equation (24) as 
y(x) = Ae 4 Belo—Bie 

_ Ae® eit at Be e7ib= 

— por (Ae*?2 i Be) ; 
We can simplify this by using Euler’s formula. Following the same 


argument as that which led to equation (18), but with Sz in place of ut, 
we conclude that 


y(z) = e** (C cos(Sx) + Dsin(Bzx)), (25) 


where C = A+ B and D =i(A — B) are arbitrary constants. If the 
required solution is real-valued, then C' and D are real-valued, and 
equation (25) is the most convenient form of the general solution. 


Example 4 

(a) Write down the auxiliary equation of the differential equation 
dy dy 
—> —6— + 13y = 0, 
ax dx ene 


and show that its roots are \y = 3+ 27 and Ay = 3 — 2i. 


(b) Hence write down the general solution of the differential equation in 
terms of sines and cosines. 


167 


Unit 3. Second-order differential equations 


168 


Solution 
(a) The auxiliary equation is 
\N — 6413 =0. 
The standard formula for the roots of a quadratic gives 


6tV386—4x1x13 624V- 6 
2 7 2 


so the two complex conjugate roots are Ay = 3+ 27 and Ag = 3 — 2i. 


= = 342i, 


(b) The roots are a+, where a = 3 and £ = 2, so using equation (25), 
the general solution of the differential equation is 


y(x) = e®*(C cos 2x + Dsin2z), 


where C and D are arbitrary constants. 


Exercise 4 


Use the auxiliary equation and Theorem 1 to find the general solution of 
each of the following differential equations. 

dy dy a6 

—, +4— + 8y =0 b) —+90=0 
ge ge (>) Get 


Equal roots 


There is a special case where the method based on Theorem 1 does not 
work. This is when the two roots of the auxiliary equation are equal. 


To see what the problem is, suppose that we have two solutions y;(a) and 
y2(x) of a homogeneous linear second-order differential equation. Then the 
principle of superposition tells us that any linear combination 


y(x) = Cyi(x) + Dyo(x) (26) 


is also a solution, and we have argued that this must be the general 
solution because it contains two arbitrary constants. But suppose that the 
functions y1(a) and y2(2) are constant multiples of one another, so that 
y2(x) = k y1(x) for all x, where & is a constant. In this case, we can rewrite 
equation (26) as 


y(z) = Cy:(x) + Dk yi (x) = (C+ kD) (2), 


which shows that there is really only one arbitrary constant, A= C+kD, 
in this case. 


If two functions are constant multiples of one another, then they are said 
to be linearly dependent. In order for equation (26) to be the general 
solution of the homogeneous linear second-order differential equation, the 
functions y; and yo must not be constant multiples of one another; we say 
that they must be linearly independent solutions. 


2 Homogeneous differential equations 


If both the roots of the auxiliary equation are equal to A, then the function 
yi(x) = e** provides one solution of the differential equation, but we still 
need another linearly independent solution in order to construct the 
general solution. Fortunately, there is a simple way of finding this extra 
solution, illustrated by the following example. 


Example 5 
(a) Write down the auxiliary equation of the differential equation 


dy dy 
— +6— + 9y =0 
dx? ar dx eae , 


and find its roots. 


(b) Deduce that y; = e~*” is a solution of the differential equation. 


(c) By substituting into the differential equation, show that y2 = re~**” is 


also a solution. 


(d) Deduce that y = (C + Dx)e~** is a solution of the differential 
equation for any values of the constants C' and D. Is this the general 
solution of the differential equation? 


Solution 


(a) The auxiliary equation is 


7? +64 +9=0. 
The left-hand side is the perfect square (\ + 3)’, so the auxiliary 
equation has equal roots A; = Ag = —3. 
(b) Since \; = —3 is a root of the auxiliary equation, y, = e~°” is a 
solution of the differential equation. 


(c) To show that y2 = re~*” is a solution of the differential equation, we 


differentiate it twice and substitute into the differential equation. 
Differentiation gives 


d 
= = e 8 4 2(—3e—**) = (1 — 8r)e—*”, Here we are using the product 
Py rule for differentiation. 
> = —3e73* + (1 — 3x)(—3e78") = (—6 + 9xr)e7*. 
Substituting these into the left-hand side of the differential equation 
then gives 
d? yp dy2 3 3 3 
Tae + OG, + 88 = (-64+ 9rJe-"* + 6(1 — 38a)e** + Ire” 


= (-6 + 6)e-** + (9-184 9)re"*” 
=). 


3: 


Hence yg = xe~”” is a solution of the differential equation. 


(d) Since y; = e~** and yo = xe *” are both solutions of the same 


homogeneous differential equation, the principle of superposition tells 
us that 


Unit 3. Second-order differential equations 


If you want to prove this, you 
can substitute y = xe>* into the 
left-hand side of equation (19), 
and use the fact that satisfies 
the auxiliary equation, with 

A = —b/2a for equal roots. 


170 


y=Ce ™ & Dre =(C + Dale ™ 
is also a solution of this equation for any values of C and D. 


This solution contains two arbitrary constants C and D that cannot 
be combined because the functions e~*” and xe~** are linearly 
independent (i.e. they are not constant multiples of one another). It is 
therefore the general solution of the differential equation. 


The method used in the above example always works when the auxiliary 
equation has equal roots. (The proof follows the same method as the 
example, but using symbols instead of numbers.) Thus whenever the 
auxiliary equation has equal roots 41 = Ag, the general solution of the 
homogeneous equation is 


y(x) = (C+ Da)e™*, (27) 


where C and D are arbitrary constants. 


Exercise 5 


Use the auxiliary equation method to find the general solution of the 
following differential equations. 

dy dy - ‘ 
(a) qm +2g, t¥=9 (b) §—46+4s =0 


2.3 General procedure and further practice 


We have now considered all the cases that can arise when solving a 

homogeneous linear second-order differential equation with constant 
coefficients. This subsection summarises the method of solution as a 
procedure, and gives exercises for further practice. 


Procedure 1 General solution of a homogeneous linear 
constant-coefficient second-order differential equation 


The general solution of the homogeneous linear constant-coefficient 
second-order differential equation 

d 
a— +b—+cy =0, (Eq. 19) 

x aE 

where a, b, c are real constants with a £0, may be found as follows. 
1. Write down the auxiliary equation 

ar? +b4+c=0, (Eq. 20) 

and find its roots A; and Apo. 


2 Homogeneous differential equations 


2. (a) Ifthe auxiliary equation has two distinct real roots 1 
and Ag, then the general solution is 


Ce aaa (Eq. 22) 


(b) If the auxiliary equation has a pair of complex conjugate 
roots Ay =a+ fi and Az = a — (i, then the general solution 
is 


y =e" (C cos Bx + Dsin Bx). (Eq. 25) 


(c) If the auxiliary equation has two equal real roots \y = 2, 
then the general solution is 


y =(C + Da)e™*. (Eq. 27) 


In each of these cases, C' and D are arbitrary constants. 


Exercise 6 


Use Procedure 1 to find the general solution of each of the following 
differential equations. 


d?y d*y 
(a) Fa t4y=0 (b) Ta 9 =0 

dy dy dy dy 

ilk I, pt ee oe Ee thay ee 
Getta ae 2 ee 


a d 
(ec) Sa t45+29y=0 — (f) u"(x) — 6u'(x) + 8u(x) = 0 


Exercise 7 


(a) Write down the auxiliary equation of the differential equation 


(b) Solve this auxiliary equation, and write down the general solution of 
the differential equation. 


Exercise 8 


Find the general solution of each of the following differential equations. 


d*y dy d?y 

dy dy d20 dg 

7% 4 Ay = 4 a a 
aa" ae ag 


171 


Unit 3. Second-order differential equations 


172 


Exercise 9 
For which values of the constant & does the differential equation 
dy dy 
Ap? 4 Ay = 0 
dat de 7 MY 
have a general solution with oscillating behaviour, that is, a general 
solution which involves sines and cosines? 


2.4 Damped harmonic oscillators 


Apart from equations (31) and (32), and the definitions of amplitude, 
phase constant, angular frequency and period, this subsection contains 
no new mathematical ideas, so if you are short of time, you may 
choose to read it quickly. However, many students find it invaluable 
to think about the solutions of differential equations in the context of 
real oscillating systems. 

YO __CCCCCCCCCCCCCCCCCCc~C~CcclllloCOCOOoocc_c_ttH_H___| 


We consider again the damped harmonic oscillator that was discussed in 
the Introduction. This system was illustrated in Figure 1. It consists of an 
object of mass m that moves along the z-axis, with position x(t) at time t. 
A spring exerts a force —kzx on the object, pulling it towards the 
equilibrium position x = 0. The object also experiences a damping (or 
frictional) force that is taken to be proportional to the object’s velocity 
dx/dt. Under these circumstances, the object obeys the equation of motion 
given in equation (6). This is the homogeneous linear constant-coefficient 
differential equation 
moe dx 
k 28 
age Va ae 
where m, k and 7¥ are positive constants. The damped harmonic oscillator 
is a good model for vehicle suspension systems, but equations like (28) 
appear in many other contexts where oscillations occur. 
Let us begin with the special case of no damping (y = 0). Equation (28) 
then reduces to the equation of motion of a simple harmonic oscillator 
dex 
— +r = 0, 
dz * 
where w = ,\/k/m. You saw earlier that the general solution of this 
equation is 
x(t) = Ccos(wt) + Dsin(wt), (Eq. 18) 


where C and D are arbitrary constants. 


2 Homogeneous differential equations 


To interpret this solution, it is helpful to note that it can also be written as 
x(t) = Asin(wt+ @), (29) 


where A and ¢ are arbitrary constants. To see why this works, expand the 
right-hand side of equation (29), to get 


x(t) = Asin(wt) cos ¢ + Acos(wt) sin ¢. 


Then comparing with equation (18), we see that equation (29) is valid 
provided that 


C=Asing and D= Acos¢. (30) 
Squaring and adding these equations gives 

C? + D* = A*(sin? 6 + cos” ¢) = A?, 
so 

A= VC? 4+ D?. (31) 


Dividing the first equation in (30) by the second, we also have 


C Asing $ 

=—= = tan¢, 

D Acosd 
so 

@ = arctan(C/D). (32) 
In equation (31) we have chosen the positive square root for A; this 
involves no loss of generality because sin(wt + 7) = — sin(wt), so increasing 


the value of @ by z is equivalent to reversing the sign of A. Values of @ 
that differ by an integer multiple of 27 correspond to the same motion, so 
we can restrict ¢@ to a range such as 0 < @ < 27. 


Figure 4 shows a graph of the solution. 


chy 


Figure 4 Simple harmonic motion with amplitude A, phase constant ¢ 
and period T 


Recall the trigonometric identity 
sin(A + B) 
= sin Acos B + cos Asin B. 


173 


Unit 3. Second-order differential equations 


w is sometimes called the 
natural angular frequency. 


174 


The constant A > 0 is the amplitude of the oscillation, which is the 
magnitude of the maximum displacement from the equilibrium position 

x =0. The constant ¢ is called the phase constant. This is related to 
the time when the oscillator passes through the equilibrium position: 
according to equation (29), x = 0 at time t = —¢/w. The oscillation 
consists of identical cycles, endlessly repeated. The time taken to complete 
one of these cycles is the period of the oscillation, and is given the 
symbol T’. Because the sine function has period 27, we have 


WT = 27 


As you might expect, the period is reduced if m is reduced (a lighter 
particle) or if k is increased (a stiffer spring). The quantity 1/T is called 
the frequency of the oscillation, and represents the number of cycles 
completed per unit time. The constant w = 27/T is called the angular 
frequency. 


Now let us return to the case where damping is present (y > 0). In this 
case, the relevant homogeneous differential equation is (28), which can be 
written more simply as 


—, + 2°— +u*r = 0, (33) 


where the damping parameter [ = 7/(2m) is a measure of the damping, 
and w = ,/k/m is the angular frequency of the corresponding undamped 
oscillator. The corresponding auxiliary equation is 


VOT =i, (34) 
and this has solutions 
A=—T4 VI? - v2. (35) 


Depending on the value of the discriminant [? — w?, there are three 
different types of solution, which correspond to three different types of 
motion. These exemplify the three cases described in Subsection 2.2: 
distinct complex roots, distinct real roots and equal roots. 


Underdamped motion: [ < w 


When I < w, the discriminant IT? — w? is negative, and the roots of the 
auxiliary equation are complex numbers: 


A=—-T+iO, where Q = Vw? —T?. (36) 


2 Homogeneous differential equations 


The general solution is therefore 

a(t) =e b* [Ccos(Mt) + Dsin(Mt)], (37) 
where C and D are arbitrary constants. Transforming the term in square 
brackets, just as in equation (29), we can also write this as 

a(t) = Ae" sin(Qt + ¢). (38) 


Figure 5 shows graphs of this function for different values of the arbitrary 
constants A and ¢. (Section 4 will explain how these arbitrary constants 
can be determined from given initial conditions.) 


TA 


Figure 5 ‘Two solutions for an underdamped harmonic oscillator with 
w = 1 andl = 1/8, corresponding to A = 1, ¢ = 0 (orange curve) and 
A=1, ¢=7/2 (purple curve) 


This motion is called underdamped because the damping force is weak 
enough for the motion of the particle to oscillate to and fro. The angular 
frequency Q of the damped oscillator is smaller than the angular 
frequency w of the corresponding undamped oscillator (see equation (36)). 
Each cycle of the damped oscillator takes a period T = 27/Q, which is 
longer than the period 27/w of the corresponding simple harmonic 
oscillation, although this effect is slight if [<< w. The amplitude of the 
damped oscillations is Ae~!*, which decreases exponentially with time. In 
this context, may be called the decay constant. If [ approaches zero, 
the decay constant approaches zero and the angular frequency 2 
approaches w, so in this limit the system behaves like a simple harmonic 
oscillator. 


Overdamped motion: I’ > w 


When I > w, the discriminant I? — w? is positive, and the roots of the 
auxiliary equation are real negative numbers: 


M=—T-— V9? —w? and Ag==-0 4+ VI" =e. 


175 


Unit 3. Second-order differential equations 


176 


In this case, the general solution of the homogeneous differential equation is 
a =Ce" 4 De, (39) 


where C and D are arbitrary constants. Figure 6 shows graphs of this 
function for different values of the arbitrary constants C' and D. 


LA 

1 
0.84 
0.64 


0.45 


0.25 


0 5 10 15 20 +t 


Figure 6 ‘Two solutions for an overdamped harmonic oscillator with 

w= 1/4 and [ = 1/2, corresponding to C = 1, D = 0 (orange curve) and 
C =0, D=1 (purple curve) 

Because A; and A» are both negative, the solution dies away exponentially, 
and there are no oscillations. This motion is called overdamped because 
the damping force is strong enough to prevent oscillations. When [ 
approaches w from above, both roots approach the value [’, and the 
solution is proportional to e~!*. 


Critically damped motion: [= w 


When I = w, the discriminant [7 — w? is equal to zero. In this case the 
general solution is 


a(t) =(C+Dije™, (40) 


where C and D are arbitrary constants. Figure 7 shows graphs of this 
function for different values of the arbitrary constants C and D. 


2 Homogeneous differential equations 


0.8 4 


0.6 4 


0.4 4 


0.2 5 


0 5 10 15 20 ¢ 
Figure 7 ‘Two solutions for a critically damped harmonic oscillator with 
w =I = 1/2, corresponding to C = 1, D = 0 (orange curve) and C = 0, 
D = 1 (purple curve) 

The motion is described as critically damped. A vehicle suspension 


system is usually set up to be close to critical damping, so that it is soft 
enough to move in response to a bumpy road, without allowing oscillations. 


Exercise 10 


Small oscillations of the pendulum shown in Figure 8 can be described by 
the homogeneous differential equation 

d?6 d@ om 
where m is the mass of the bob, / is the length of the string, g is the 
magnitude of the acceleration due to gravity, and y is a damping constant. 
With mass measured in kilograms, length in metres and time in seconds, 
the values of these constants are m = 0.80, | = 2.0, g = 9.8 and y = 0.016. 


(a) Is this oscillation underdamped, overdamped or critically damped? 
(b) What is the period of the oscillation? 


(c) If the initial amplitude of the oscillation is 6 = 0.20 (in radians), what Figure 8 A pendulum 
is the amplitude at t = 100? 


Exercise 11 


A critically damped harmonic oscillator is described by the differential 
equation (28), with m = 1 and k = 4 (in suitable units). Determine the 
value of I in these units, and write down an expression for the general 
solution x(t). 


177 


Unit 3. Second-order differential equations 


The notation y.(a) and yp(x) 
(see below) will be explained 
shortly. 


178 


3 Inhomogeneous differential 
equations 


3.1 General method of solution 


Section 2 focused on finding the general solution of homogeneous linear 
constant-coefficient second-order differential equations. This section 
explains how to find the general solution of inhomogeneous linear 
constant-coefficient second-order differential equations — that is, equations 
of the form 


ate 

a +02 + oy = Fla ), 
where a, b, c are constants, with a 0, and f(z) is a given continuous 
function of x. The basic method for finding the general solution of such an 
equation depends on the principle of superposition, and is illustrated in the 
following example. 


Example 6 

Exercise 3(a) showed that the homogeneous equation 
d?y dy 
= ue 6y — 0 41 
a2 a (41) 


has the general solution y.(a2) = Ce~?* + De~®”, where C and D are 
arbitrary constants. 


Use this fact, together with the principle of superposition, to show that the 
inhomogeneous equation 


dy dy 
5— + 6y = 12 42 
Ae ae Pe (42) 
has a solution y(2) = Ce~2 + De~** + 2 for any C and D. 
Solution 


We can show that the constant function yp(x) = 2 is a particular solution 
of the inhomogeneous equation by substituting it into the left-hand side. 
This a 


dyfp 
da dx 
which is the same as the right-hand side, as required. 


+ 6yp =0+5x0+6x2=12, 


We know that y.(x) = Ce~** + De~** is a solution of equation (41) for 
any C' and D, and yp(x) = 2 is a solution of equation (42). The principle 
of superposition then tells us that yo(x) + yp(x) is a solution of 


dy dy 
ee 6y = 124+0= 12, 
Fe a 


3 Inhomogeneous differential equations 


which is identical to equation (42). Hence y(x) = Ce~?* + De~* +2 isa 
solution of equation (42) for any C and D. 


The solution found in the example above contains two arbitrary constants, 
C and D (which appear as coefficients of two linearly independent 
functions, e~?” and e~°”). We expect the general solution of any 
second-order differential equation to contain two independent arbitrary 
constants, so y(x) = Ce~?* + De~*” + 2 is the general solution of 
equation (42). We will now generalise this idea, but first it is helpful to 
introduce some terminology. 


Corresponding to the inhomogeneous equation (42), we have the 
homogeneous equation (41), obtained by replacing the function on the 
right-hand side by zero. This is called the associated homogeneous 
equation. The solutions y, and y, also have special names in this context: 
Yc(x), the general solution of the associated homogeneous equation (41), is 
called the complementary function, and yp(x), a particular solution of the 
inhomogeneous equation (42), is called a particular integral. 


Definitions 


Let 


dy oY 

aos ae ed = A) (43) 
be an inhomogeneous linear constant-coefficient second-order 
differential equation. 


e Its associated homogeneous equation is 


ahd 4H + ey =O, (44) 
e The general solution y.(xz) of the associated homogeneous 
equation is known as the complementary function for the 
original inhomogeneous equation (43). 


e Any particular solution yp(x) of the original inhomogeneous 


equation (43) is referred to as a particular integral for that The term particular integral is 
equation. used here, rather than the term 
particular solution, which we 
reserve for a solution that 
The particular integral is not unique — many different choices can be made, contains definite numbers rather 


but that does not matter. Suppose that yp, and yp, are two different than arbitrary constants (see 
particular integrals for equation (43). Then the principle of superposition Section 4). 
tells us that yp, — yp, is a solution of the equation 


d? dy 
a5 +b +cy = f(x) — f(x) =0, 


which is the associated homogeneous equation (44). But we know that the 
general solution of this equation is given by the complementary 
function yc, so it follows that yp, — yp, = Yc, or equivalently, 


Yps = Ye ae Ypy: 
179 


Unit 3. Second-order differential equations 


See Exercise 4(b), although 
there different symbols are used 
for the variables. 


You will see in the next 
subsection how to find such a 
particular integral. 


180 


Since any solution can be written in this form, it follows that yc + Yp, is 
the general solution of the inhomogeneous equation (43). We have 
therefore proved the following important result. 


Theorem 2. General solution of an inhomogeneous equation 


The general solution of a linear constant-coefficient second-order 
differential equation is given by 


y(x) = yc(x) + Yp(x), 
where y,-(x) is the complementary function (the general solution of 


the associated homogeneous equation) and yp(a) is a particular 
integral of the inhomogeneous equation. 


Note that y.(x), being the general solution of the associated homogeneous 
equation, will contain two arbitrary constants, whereas yp(x), being a 
particular solution of equation (43), will contain none. You have already 
seen how to find the complementary function y,(x) in Subsection 2.2. 
There is no general recipe for finding a particular integral yp,(x), but there 
are methods for ‘guessing’ a suitable solution, which work in most cases. 
The following example illustrates the general technique. 


Example 7 

Find the general solution of the differential equation 
d2 
oY 1 Oy = 99 +9. (45) 
dx? 

Solution 


The associated homogeneous equation is 


OY pigaG 
dx2 Yy ~~ Y 

which has the general solution 
Ye = Ccos 3a + Dsin 3a, 


where C and D are arbitrary constants. This is the complementary 
function for equation (45). 


A particular integral for equation (45) is 
Yp =xr+. 


This may be verified by differentiation and substitution: , = 0), so 
substituting into the left-hand side of equation (45) gives 


i Fs =O Oe 1) Se +9, 


which is the same as the right-hand side of equation (45), as required. 


3 Inhomogeneous differential equations 


The general solution of equation (45) is therefore, by Theorem 2, 
Y=VUetYp = Ccos3z+ Dsinde+2+1, 


where C' and D are arbitrary constants. 


The method of Example 7 may be summarised as follows. 


Procedure 2 General solution of an inhomogeneous linear 
constant-coefficient second-order differential equation 


The general solution of the inhomogeneous linear 
constant-coefficient second-order differential equation 


dy dy 
a— +b—+cy= f(a 
az t og, toy = Fe) 
is found as follows. 
1. First find the complementary function y,(x), i.e. the general The reason why y, is found first 
solution of the associated homogeneous equation will become clear in 
P d Subsection 3.3. 
Yy Y 
a— +b—+cy=0 
ax? a dx a: 


using Procedure 1. 
2. Then find a particular integral yp(x). 
3. The general solution is then y(x) = yc(x) + yp(Z). 


It is worth noting that, by Theorem 2, any choice of particular integral in 
Procedure 2 gives the same general solution. Formulas obtained for the 
general solution may look different for different choices of particular 
integral, but they are in fact always equivalent. For example, in Example 7 
the particular integral y, = «+1 was chosen, and the general solution was 
obtained as y = Ccos3a + Dsin3x + «+1. It would have been equally 
valid to have chosen yp = x + 1 +sin3z as the particular integral. In that 
case, the general solution would have been obtained as 

y = Ccos3x2+ Dsin3z +2+1+4sin3z. This form looks a little different, 
but it may be written as y = Ccos3x+(D+1)sin3z +2 +1; and since C 
and D are arbitrary constants, this form of the general solution represents 
exactly the same family of solutions. 


Exercise 12 
For each of the following differential equations: 


e Write down its associated homogeneous equation and its 
complementary function Ye. The complementary functions 
. ; . _ : can be found from Exercise 6(a) 
e Find a particular integral of the form y, = p, where p is a constant. and Pecunple 
e Write down the general solution of the differential equation. 


d?y d’y dy 


181 


Unit 3. Second-order differential equations 


182 


In Exercise 12, where the right-hand sides of the equations are constants, 
it is possible to find a particular integral almost ‘by inspection’; but this 
method is generally inadequate. Fortunately, there exist procedures for 
finding a particular integral for equations involving wide classes of 
right-hand-side functions f(z). The remainder of this section considers 
some of the simpler cases, where it is possible to determine the general 
form of a particular integral by inspection, although some manipulation is 
needed to determine the values of certain coefficients. 


3.2 The method of undetermined coefficients 


You have seen that the linear constant-coefficient second-order differential 


equation 
dy dy 
ara + ae +cy = f(z) (Eq. 43) 


has the general solution y(~) = yc(x) + yp(x), where the complementary 
function y_(a) is found by solving the associated homogeneous equation, 
using Procedure 1. However, the methods for finding a particular integral 
Yp(xz) are another matter. 


We can proceed by guessing that y,(x) takes some general form, called a 
trial solution, which involves one or more constants (or coefficients) 
whose values are initially undetermined. Then, by substituting the trial 
solution into the differential equation, we can hope to find the values of 
these coefficients, and hence find a particular integral. This is called the 
method of undetermined coefficients. 


You saw an example of this method in Exercise 12. There the function 
f(x) on the right-hand side of the differential equation is a constant, and 
the trial solution is taken to be of the form yp, = p, where p is an unknown 
constant, whose value is determined by substituting into the differential 
equation. 


The choice of trial solution depends on the function f(z) on the right-hand 
side of equation (43). We will look at three cases: 


e ~=polynomial functions 
e functions with exponential behaviour 
e sinusoidal functions. 


When we have considered examples for each case, we will gather 
everything together as a general procedure. Bear in mind, though, that the 
method finds only a particular integral for the differential equation; to find 
the general solution you also need to find the complementary function and 
add this to the particular integral, according to Procedure 2. 


3 Inhomogeneous differential equations 


Polynomial functions 
If f(x) is a given polynomial, we have 
f(z) = mnt” + mMp—12z" | +++ + mz t+mo, 


where mo, 71,..., Mn are given constants. This class of functions includes 
constant functions (n = 0), linear functions (n = 1), quadratic functions 
(n = 2) and higher-order polynomials (n > 3). 


Let us start by considering the case where f(z) is a linear function 
(a polynomial of degree 1). 


Example 8 
Find a particular integral for 
d?y dy 
3— — 2 = 47 4 2. 
dx? dx a 
Solution 


We try a solution of the form 


Y= pix + po, 


where p; and po are coefficients to be determined so that the differential 
equation is satisfied. To try this solution, we need the first and second 
derivatives of y: 
dy _ dy = 
de PY Gaz — 
Substituting these into the left-hand side of the differential equation gives 
d? d 
35 — 254 + y =3 x 0~ 2pi + (pix + Po) 
dx dx 
= pix + (po — 2p1). 
For y = pix + po to be a solution of the differential equation, we require 
that 


pix + (po — 2p1) =4e+2 for all x. (46) 


To find the two unknown coefficients p; and pp, we compare the 
coefficients on both sides of equation (46). Comparing the terms in x gives 
p, = 4. Comparing the constant terms gives pp — 2p; = 2, so 

pop = 24+ 2p; = 2+2x4=10. Therefore we have the particular integral 


Yp = 4x + 10. 
Check: If yp = 4x + 10, then dyp/dx = 4, d?y,/dx? = 0, and substituting 
into the left-hand side of the differential equation gives 


dy, dy 
Get ag tye 3x0 2x4+(4r+10) =4r +2, 


as required. 


For simplicity, we denote the 
trial solution by y, with no 
subscript p. 


Notice that we can get two 
separate bits of information 
from the same equation because 
it applies for all x. 


183 


Unit 3. Second-order differential equations 


You saw examples of this in 
Exercise 12. 


184 


In the example above, the target function f(a) = 4x + 2 was a linear 
function, and the trial solution p;x + po was also a linear function. When 
this trial solution was substituted into the left-hand side of the differential 
equation, it produced another linear function, p,x + (po — 2p1), whose 
coefficients could be compared with those of the target function. 


This is really the key to the method. The idea is to choose a trial solution 
that includes undetermined constants which, when substituted into the 
left-hand side of the differential equation, generates a function that can be 
compared directly with the target function f(x), allowing the constants to 
be found. 


This generally means that the trial solution should be chosen to belong to 
the same class of functions as f(a) on the right-hand side of the equation. 
However, we must choose a trial solution that is general enough, so that its 
first and second derivatives also belong to the same class as the trial 
solution itself. This is illustrated in the following exercise. 


Exercise 13 
Use trial solutions of the form y = p,x + po to find particular integrals for 
each of the following differential equations. 


dy dy d’y dy 
apes =) Ag Ae eee, 
(a) dx? ee + 2y oe (b) dx? 7" dx ue . 


Note that the trial solution y = p;x + po was suggested in Exercise 13(b), 
even though f() is just a multiple of x and contains no constant term. In 
fact, a trial solution of the form p,x would not work in this case. The 
reason is that we need to take the first and second derivatives of the trial 
solution and substitute them into the differential equation. These 
derivatives must belong to the class encompassed by the trial solution. In 
this case, the first derivative of p;x is the constant function p;, so the trial 
solution must contain a constant term. 


In general, the following advice can be given: 


e If f(a) = mo is a constant function, then you should use a constant 
trial solution of the form y = po. 


e =f f(x) = mx + mo is a linear function, then you should use a linear 
trial solution of the form y = pix + po. Even if mop = 0, you should not 
initially assume that po = 0. 


e More generally, if f(2) = myx" + mp_12"—! +++» +m 2+ mo, where 
My #~ 0, then you should use a trial solution of the form 
Y = Dnt” + ppv”! +-+++ pix + po. Even if some of the coefficients 
mo,™M1,.--,Mn—1 are equal to zero, you should not initially assume 
that any of the coefficients po, p1,..., Pn are equal to zero. 


3 Inhomogeneous differential equations 


Exercise 14 
Find a particular integral for 


y’ —y=e?. 


Functions with exponential behaviour 


Now let us suppose that the target function f(x) on the right-hand side of 
the inhomogeneous equation takes the form f(x) = me**, where m and k 
are constants. In general, such functions are not the exponential function, 
but we can say that they exhibit exponential behaviour. The following 
example shows how to find a particular integral in such a case. 


Example 9 
Find a particular integral for 


dy 3 
Solution 


We try a solution of the form 
ype", 


where p is an undetermined coefficient that can be found by requiring that 
the differential equation is satisfied. Differentiating y = pe?” gives 


dy ga ay 
— = 3ne2*, —= = Ope**. 
Substituting these into the left-hand side of the differential equation gives 


2 


d 
7 + 9y = 9pe** + Ope?” = 18pe%*. 


Therefore, for y = pe** to be a solution of the differential equation, we 
require that 18pe3* = 2e%* for all x. Hence p = $, and 


— 1,32 
Yp = 9€ 


is a particular integral for the given differential equation. 


The general rule is as follows: when f(x) = me**, we use a trial solution of 


the form y(a) = pe**, where p is an undetermined constant. The value of k 
in y(x) is the same as in f(z). 


Exercise 15 
Find a particular integral for 
d?y 


dy 
god Oh ay = 26-2, 
dx? dx es 


Since the derivative of e3” is 
3e°*, the exponent (32) 
appearing in y(a) should be the 
same as that appearing in f(x), 
then only the coefficient p is to 
be determined. 


185 


Unit 3. Second-order differential equations 


Comparing coefficients works 
because the cos and sin functions 
are linearly independent. 


186 


Sinusoidal functions 


Finally, let us suppose that the target function f(a) on the right-hand side 
of the inhomogeneous equation takes the form 


f(x) =mcoskx + nsinkz, 


where m, n and k are constants. Any such function is said to be 
sinusoidal. In this case, the appropriate trial solution is 


y(x) = pcoskx + qsinkz, 


where p and q are undetermined constants. Following earlier ideas, the trial 
solution must be general enough to be in the same class as its first and 
second derivatives. So even if f(a) contains only a sine or only a cosine, 
the trial solution y(z) must contain both a sine and a cosine. However, the 
value of the constant & in y(a) should always be the same as that in f(z). 


Example 10 
Find a particular integral for 
- = rd + 2y = 10sin 22. 
Solution 
We try a solution of the form 

y = pcos 2x + qsin 22, 


where p and q are coefficients whose values are to be found by substituting 
into the differential equation. Differentiating y gives 


d d? 
oe —2psin 2% + 2q cos 22, oe —4p cos 2a — 4qsin 22. 
dx dx? 
Substituting these into the left-hand side of the differential equation gives 
dy 


FP) + 2 + 2y = (—4pcos 2x — 4qsin 2x) + 2(—2psin 2x + 2q cos 2x) 
+ 2(pcos 2x + qsin 22) 
= (—2p + 4q) cos 2a + (—4p — 2g) sin 2z. 
This can be equated to the right-hand side of the differential equation, so 
(—2p + 4q) cos 2a + (—4p — 2q¢)sin 2% = 10sin2z for all z. (47) 


To find p and q, we compare the coefficients of cos and sin on both sides of 
equation (47). For this equation to be true for all 2, we must have 


—2p+4q=0 and —4p—2q = 10. 
Solving these simultaneous equations, we conclude that p = —2, q = —1, so 
Yp(x) = —2.cos 2x — sin 2x 


is a particular integral for the given differential equation. 


3 Inhomogeneous differential equations 


Exercise 16 


Find a particular integral for 


d? d 
= _ — = cos 3t + sin 3t. 


The following procedure summarises the results of this subsection. 


Procedure 3. Method of undetermined coefficients 


To find a particular integral for the inhomogeneous linear 
constant-coefficient second-order differential equation 


ay dy 
baad i ale = 
aT ap ae +cy = f(x), 
use a trial solution y(x) with a form similar to that of f(«). The 
following table gives appropriate trial solutions for simple cases. 


Target function f(x) Trial solution y(x) 

Mnn” + Mae + oo 6 Dae” eit a + oo 
+mx+mo + pix + po 

mek pe*® 

mcoska + nsin kx pcoska + qsin ka 


The full trial solutions in the right-hand column should be used even 
when some of the coefficients in f(x) are missing (as in m2x? or 


mcos ka, for example). 


To determine the coefficients in y(x), differentiate it twice, substitute 
into the left-hand side of the differential equation, and equate 
coefficients of corresponding terms. 


Exercise 17 


What form of trial solution y should you use in order to find a particular 
integral for each of the following differential equations? 
dy | dy 


d*y an 
(a) —S-y=e (b) qt + 2g, 7 fy = Sinde 


To round off this discussion of inhomogeneous differential equations, the 
following exercise gives a variety of equations for further practice. 


There are exceptional cases 


where these trial solutions do 


not work; see Subsection 3.3. 


In this question, you need not 


find the particular integrals. 


187 


Unit 3. Second-order differential equations 


See Exercise 6(a) and 
Exercise 3(c). 


See Exercise 3(c). 


188 


Exercise 18 
Find the general solutions of the following differential equations. 


The complementary functions of the differential equations in parts (a) 
and (c) are C'cos 2t + Dsin2t and Ce~* + De?*, respectively. 


a0 
(a) die + 40 = 2t 
2 
(b) az + 4y = 10sin 3x 
x 
d2 
(c) 3 — 4y = l5e~* 


3.3 Exceptional cases 


There are some exceptional cases for which Procedure 3 fails. You will not 
encounter these exceptions in assignments or in the exam, but it is useful 
to have some idea of what can go wrong, and how a particular integral can 
be found under such circumstances. 


To take a definite case, consider the differential equation 


dy 2 

Pema a ee (48) 
The associated homogeneous equation is 

d?y 

—— —4y=0 

dx2 y ? 


and this has general solution y = Ce~?* + De?*. The difficulty is now 
apparent. Procedure 3 suggests that we use the trial solution y = pe”, but 
this happens to be a solution of the associated homogeneous equation 
(with C = 0, D =p), so substituting y = pe?” into the left-hand side of 
equation (48) gives zero for any value of p, and this cannot be equal to the 
non-zero right-hand side. 


Difficulties like this are generally overcome by multiplying whichever trial 
solution is suggested in Procedure 3 by the independent variable, 7. So the 
trial solution to use for equation (48) would be 


y = pre”. 


Calculating the derivatives of this function, we get 


d 
= = pe** + Qnxe?* = p(1 + 2xr)e””, 
c 
dy 
i = Qpe”* + 2p(1 + Qr)e?* = 4p(1 + x)e”". 
rc 
Substituting these into the left-hand side of the differential equation gives 
d?y 2 2 2 
7) — dy = 4p(1 + x)e** — 4pre*” = Ape*”. 


3 Inhomogeneous differential equations 


Therefore y = pre? is a solution of the differential equation provided that 
Ape?” = 2e?* for all x. Hence p = 5; and 


=. t 22 
tp = oe 


is a particular integral for differential equation (48). 


A similar technique works when the inhomogeneous term is a constant, as 
illustrated in the following example. 


Example 11 


The motion of a small ball bearing dropped into viscous oil can be 
modelled by the differential equation 


mz+rz—mg = 0, 


where m is the mass of the ball, r is a constant related to the viscosity of 
the oil, g is the magnitude of the acceleration due to gravity, and x is the 
vertical distance from the point of release. 


(a) Find the general solution x(t) of this differential equation. 


(b) Use your answer to part (a) to show that the velocity of the ball 
approaches the limiting value mg/r as t becomes very large. 


Solution 
(a) The inhomogeneous equation is 
mi+rx’z = mg, 
and the auxiliary equation for the associated homogeneous equation is 
mM +rrA=0. 


This has solutions \ = 0 and \ = —r/m, so the complementary 
function is 


Le = C + De Tt/™, 
The inhomogeneous term is the constant function mg, so Procedure 3 
suggests a trial solution x = po. However, this is a solution of the 


associated homogeneous equation (with C = pp, D = 0). Hence we try 
x = pot instead. Differentiating and substituting gives 


rpo = ™4q, 
sO 
mg 
Po = — 
Tr 


Hence a particular integral is 


mgt 
fp =, 


and the general solution is 
t 
a(t) = C+ De/™ + _ 
; 
where C and D are arbitrary constants. 


189 


Unit 3. Second-order differential equations 


190 


(b) The velocity is 


Dr 


ipa pe ge re ae mg 
m 


- 
This approaches the limiting value mg/r as t + oo. 

This answer can also be obtained directly from the differential 
equation. If the velocity tends towards a constant value, the 


acceleration 7 approaches zero, and the differential equation itself tells 
us that ¢ tends to mg/r. 


3.4 Combined cases 


Another situation that crops up occasionally is when the inhomogeneous 
term is a linear combination of polynomial, sinusoidal and exponential 
functions. You know what to do when the inhomogeneous term is x + 1, 
and you also know what to do if it is e?”, but what if the inhomogeneous 
term is 2e?” + 18(a +1)? The secret is to use the principle of 
superposition to split the problem into smaller tasks. Again, we include 
this topic for interest and completeness: assignments and the exam will 
not contain questions on such combined cases. 


Example 12 

Find a particular integral for 
a 
<4 4 oy = 26% + 182 + 18. (49) 
dx 

Solution 


In Example 9 (Subsection 3.2), you saw that yp = $e" is a particular 
integral for 
d’y 32 
In Example 7 (Subsection 3.1), you saw that yp = x + 1 is a particular 
integral for 

dy 

ae + 9y = 9x4 9. 
Therefore, by the principle of superposition, a particular integral for 


equation (49) is 
Yp = Ger" + 2(@ +1) = fer" +2042. 


4 Initial conditions and boundary conditions 


This approach can be used more generally. To find a particular integral for 


2 
at to tey= i fi(x) + ke fala), (50) 
x dx 
where k, and kz are constants, we can split the task up — finding the 
particular integral g(a) that applies when just f;(x) is on the right-hand 
side, and the particular integral go(x) that applies when just fo(a) is on 
the right-hand side. The principle of superposition then tells us that 


ky gi(a) + ke go(x) is a particular integral for equation (50). 


Exercise 19 
Find a particular integral for the differential equation 


dz da 
2— + 3— + 2x = 12cos 2t+ 10. 
dt? dt 
(Neither of the terms on the right-hand side satisfies the associated 
homogeneous equation, so this is not an exceptional case.) 


Exercise 20 


Find the general solutions of the following differential equations (starting 
from scratch with no complementary functions given). 


(a) u(t) + 4u’(t) + 5u(t) =5 (b) 3— - 2 —y=e 


4 Initial conditions and boundary 
conditions 


In Sections 2 and 3 you saw how to find the general solution of a 
homogeneous or inhomogeneous linear constant-coefficient second-order 
differential equation. In practice, however, we usually need to select a 
particular solution that satisfies certain additional conditions. This section 
explains how this is done. 


In Unit 2 you saw that the general solution of a first-order differential 
equation contains one arbitrary constant, and that a single additional 
condition (called an initial condition) is enough to fix the value of this 
constant and hence determine the particular solution. In the case of a 
second-order differential equation, the general solution contains two 
arbitrary constants, and two additional conditions are required. 


There are two types of additional conditions for second-order differential 
equations: initial conditions and boundary conditions. Problems involving 
such conditions are called initial-value problems and boundary-value 
problems, respectively, and are discussed in Subsections 4.1 and 4.2. 


191 


Unit 3. Second-order differential equations 


~ 
See 


Figure 9 Possible initial 
conditions for a pendulum 


192 


4.1 Initial-value problems 


For a first-order differential equation, an initial condition is one that 
specifies the value of the dependent variable (y = a, say) at a given value 
of the independent variable (t = to); this is often written in the form 
y(to) =a. 


For a second-order differential equation, the initial conditions specify the 
value of the dependent variable (y = a) and the value of its derivative 
(dy/dy = b), for the same given value of the independent variable (t = to), 
and they are often written in the form y(to) =a, y’(to) = b. 


Very often, to represents the initial time when a system is released and we 
are interested in the subsequent motion. But this is not essential; we could 
equally well be interested in the prior motion that leads up to given values 
of x and dx/dt at some final time to. Indeed, initial values may have 
nothing to do with time at all if the independent variable represents some 
other quantity such as position. The only essential point is that values of 
the dependent variable and its derivative must both be given at the same 
value of the independent variable. 


Initial conditions arise naturally in many mechanical problems, where the 
initial values of the position x and velocity dx/dt are often specified. For 
example, we may know that a ball is thrown vertically upwards, at t = 0, 
from an initial position with an initial velocity. 


The pendulum in Figure 9 gives another example. When the string of the 
pendulum makes its greatest angle 6) with the vertical, the pendulum 
changes its direction of swing and comes momentarily to rest. So if the 
pendulum changes direction at t = to, we have the initial conditions 0 = 9 
and d0/dt = 0 when t = to. 


Definitions 


e Initial conditions associated with a second-order differential 
equation with dependent variable y and independent variable x 
specify that y and dy/dz take values a and 6, respectively, when 
x takes the value 79. These conditions can be written as 


Per eee eer 
dx 


or as 
y(t) =a, y'(xo) = b. 
The numbers xo, a and 0 are often referred to as initial values. 


e The combination of a second-order differential equation and 
initial conditions is called an initial-value problem. 


The following example shows how initial conditions can be used to find the 
two arbitrary constants in the general solution, and hence determine a 
particular solution of a second-order differential equation. 


4 Initial conditions and boundary conditions 


Example 13 
The general solution of the differential equation 
d’y dy 
— -—3— +2y=0 
dx? dx wee 
is 
y = Ce® + De*, (51) 


where C and D are arbitrary constants (see Example 3). Find the 
particular solution that satisfies the initial conditions y = 0 and dy/dxz = 1 
when x = 0. 


Solution 


One of the initial conditions involves dy/dx, so we take the derivative of 
the general solution (51), getting 


d 
- =Ce 1 2ne™, (52) 
The initial conditions state that y(0) = 0, y/(0) = 1. Substituting x = 0, 
y = 0 into equation (51) gives 
0=Ce®+ De® =C+D, 
while substituting x = 0, dy/dx = 1 into equation (52) gives 
1=Ce®+2De° = C+ 2D. 
Solving these equations gives C = —1, D = 1, so the required particular 
solution is 


y=—e* +e". 


All the initial-value problems that you will meet in this module have 
unique solutions, so if you can find a solution that satisfies the initial 
conditions, then this is the solution. 


Exercise 21 
Solve the following initial-value problems. 


(a) The general solution of w’(t) + 9u(t) = 0 is u = Ccos3t + Dsin 3t, 
where C and D are arbitrary constants (see Exercise 4(b)). Find the 
particular solution that satisfies the initial conditions u(3) = 0, 

w(§) =1. 

(b) The general solution of w(t) + 4u’(t) + 5u(t) = 5 is 
u =e *4(C cost + Dsint) + 1, where C and D are arbitrary constants 
(see Exercise 20(a)). Find the particular solution that satisfies the 
initial conditions u(0) = 3, u’(0) = 1. 


Note that when you are checking 
a particular solution, you should 
check that it satisfies the initial 
or boundary conditions as well 
as the differential equation. 


193 


Unit 3. Second-order differential equations 


T + > 


ry Lg & 


Figure 10 Finding the shape 
of a hanging chain with fixed 
ends 


194 


4.2 Boundary-value problems 


In initial-value problems, the two conditions used to select a particular 
solution both refer to the same value of the independent variable. But this 
need not be the case. If the independent variable is 7, then we could have 
one condition for x = x; and another for x = x2, say. Such conditions are 
called boundary conditions. Later in the module (Unit 12) you will meet 
equations called partial differential equations which explain a vast range of 
phenomena; in many cases, these equations lead to second-order 
differential equations supplemented by boundary conditions. 


Boundary conditions arise, for example, in considering the shape of a chain 
of length L that is suspended between two points, with heights hy; and hg 
(see Figure 10). If y(a) is the height of the chain at a horizontal distance x 
from an origin, then the equation for y(x) has to satisfy two boundary 
conditions: y(a1) = hy and y(x2) = hg. This pair of boundary conditions 
gives the value of y at two different points. In other contexts, each 
boundary condition could specify the value of either y or dy/dx (or even a 
relationship between them). 


The conditions are called ‘boundary’ conditions because (as in the chain 
example) they often refer to conditions at the endpoints x; and x2 of an 
interval in which we want to explore the solutions of a differential 
equation. (This is not essential, however.) 


Definitions 


Consider a second-order differential equation with dependent 
variable y and independent variable x. 


e Boundary conditions associated with such an equation specify 
the values of y (or dy/dx, or some combination of y and dy/dz) 
at two different values of x. For example, they could specify that 
y(z1) = y1 and y(x2) = y2. The numbers 21, x2, yi and yz are 
often referred to as boundary values. 


e The combination of a second-order differential equation and 
boundary conditions is called a boundary-value problem. 


The following example shows how boundary conditions can be used to find 
values for the two arbitrary constants that appear in the general solution of 
a second-order differential equation, and hence find a particular solution. 


Example 14 

The differential equation 
dy 
— + 9y =0 
dx? rey 


has general solution 


y = Ccos3z + Dsin 3a, (53) 


4 Initial conditions and boundary conditions 


where C and D are arbitrary constants (see Exercise 4(b)). Find the 
particular solution that satisfies the boundary conditions y = 0 when x = 0 
and dy/dxz = 1 when x = . 


Solution 


One of the boundary conditions involves dy/dx, so we need the derivative 
of the general solution (53): 


“ = —3C sin 3x + 3D cos 32. (54) 
The boundary conditions state that y(0) = 0, y’ (4) = 1. Substituting 
x = 0, y = 0 into equation (53) gives 
0=Ccos0+ Dsin0 = C, 
so C = 0. Substituting « = F, y' = 1 and C = 0 into equation (54) gives 
1=3Dcos7 = —3D. 
Therefore C = 0 and D= —, so the required particular solution is 


y = —}sin(3z). 


Unlike initial-value problems, some boundary-value problems may have no 
solutions even when the differential equation is linear and 
constant-coefficient, and has a continuous function on the right-hand side. 
The following example illustrates this point. 


Example 15 

The differential equation 
dy 
—>+4y=0 
dx? ee 


has general solution y = C'cos 2x + Dsin 2x, where C and D are arbitrary 
constants (see Exercise 6(a)). Try to find a solution to the boundary-value 
problem based on this differential equation and the boundary conditions 


y(0) =0, y(§) =1. 
Solution 


Substituting each of the boundary conditions into the general solution in 
turn gives 


0 = Ccos0+ Dsin0=C, 
1=Ccos7+ Dsinaz = —C. 


There is no solution for which C = 0 and C = —1, so there is no solution of 
the differential equation that satisfies the given boundary conditions. 


195 


Unit 3. Second-order differential equations 


196 


An outcome of ‘no solution’ need not be unreasonable. If a physical system 
obeys a differential equation, and there is no solution consistent with a 
given set of boundary conditions, then those boundary conditions must be 
unrealistic. For example, the chain in Figure 10 cannot hang from points 
that are further apart than its fixed length. 


It is also possible for boundary-value problems to have solutions that are 
not unique, as the following important example illustrates. 


Example 16 


At a given moment in time, the displacement y of an oscillating guitar 
string at a distance x from one of its ends satisfies the differential equation 


d2 

a + k?y =0 

for a fixed constant k, which is proportional to the frequency of the 
oscillation (i.e. the pitch of sound produced). The string is held fixed at its 
ends, z = 0 and x = L, so that y(x) satisfies the boundary conditions 

y(0) = 0 and y(L) = 0. Find the possible solutions to this boundary-value 
problem, and show that solutions can be found only if k = mn/L, where n 
is an integer. 


Solution 

The general solution of the differential equation is 
y(x) = Asin(ka) + Bcos(ka), 

where A and B are arbitrary constants. 


The boundary condition y(0) = 0 implies that B = 0, so y(x) = Asin(ka). 
In order to satisfy the boundary condition y(L) = 0, we must have 
Asin(kL) = 0. This equation has the trivial solution A = 0, which 
corresponds to y(x) = 0. For A ¥ 0, it gives the condition sin(kL) = 0, 
which implies that kL = nz, where n is an integer. Hence k = na/L and 
we conclude that the displacement of the guitar string must be of the form 


=) 


y(x) = Asin (= (55) 


When considering the possible sinusoidal shapes adopted by an oscillating 
guitar string, we can restrict n to the positive integers n = 1,2,3,.... This 
is because A sin(—n7a/L) = —Asin(nax/L), so changing the sign of n is 
equivalent to changing the sign of A. The value n = 0 can also be omitted, 
because it is equivalent to taking A = 0. 


4 Initial conditions and boundary conditions 


Motions with different values of n are said to be different modes of 
oscillation of the string. The first three modes of oscillation are shown 
schematically in Figure 11; in each case, the largest excursions of the string 
from the equilibrium y = 0 position are indicated. 


A 
y(x) i= 

0 Z x 
y(a) t = 2 

0 i x 
y(a) t n=3 

0 i x 


Figure 11 The first three modes of oscillation of a guitar string. These 
modes produce sounds of different pitches. 


The fact that there are many different solutions satisfying the boundary 
conditions should not alarm you. This just means that the boundary 
conditions restrict the possible solutions, but extra information (in the 
form of initial conditions telling us how the guitar string is released) is 
needed to determine the precise motion that arises in a given situation. 


Partial differential equations 


The boundary conditions allow us to identify the possible modes, but 
further information is needed to predict the actual motion of the 
string. You will see how this works in Unit 12, when we discuss a 
more general type of differential equation, called a partial differential 
equation. The process of solving a partial differential equation often 
leads to a boundary-value problem for a second-order differential 
equation, and this is the main reason why boundary-value problems 
are important in physics and engineering. 


An example arises in quantum mechanics, which can give rise to a 
differential equation very like that for a vibrating string. In this 
context, the fact that the constant k takes discrete values corresponds 
to the fact that the system has discrete energy levels, a phenomenon 
called the quantisation of energy. 


197 


Unit 3. Second-order differential equations 


198 


Exercise 22 
This question refers to the differential equation 
u(x) + 4u(x) = 0, 


which has the general solution u = C'cos 2a + Dsin 2x, where C and D are 
arbitrary constants (see Exercise 6(a)). 


Each part gives a set of additional conditions. State whether these are 
initial conditions or boundary conditions, and find the solution (or 
solutions) that satisfy both the differential equation and the additional 
conditions. 


(a) aO=1,070)=0 (b) a0)=0, a(4) =0 
(c) u(Z) =0, w’(F) =0 (d) u(—m) = 1, u(Z) =2 


5 Resonance 


The material in this section is non-assessable and will not be tested in 
continuous assessment or in the exam. However, we strongly advise 


students of physical science to study it, as these ideas will be used in 
higher-level modules. 


In Subsection 2.4, we used the physical example of a damped harmonic 
oscillator to gain insights into the solutions of homogeneous linear 
constant-coefficient second-order differential equations. This final section 
does something similar for inhomogeneous equations. 


Suppose that an object of mass m moves to and fro along the x-axis 
around an equilibrium position x = 0. The object has position x(t) at 
time t. It experiences a force —kx due to a spring, and a damping force 
—ydx/dt that opposes the motion of the object, where the constants m, k 
and ¥ are all positive. Up to this point, the system is just the damped 
harmonic oscillator considered earlier. 


Now suppose that an additional time-dependent force f(t) is applied to the 
object. This force is due to some external agency, and does not depend on 
the object’s position or velocity. Then Newton’s second law tells us that 
dx d?x 

—kx —y— 4+ f(t) =m—,, 
which is equivalent to the inhomogeneous linear constant-coefficient 
second-order differential equation 

d?x dx 


5 Resonance 


This system is known as a forced damped harmonic oscillator, and 
the external force is called the driving force. 


A very important case occurs when the externally-applied driving force is a 
sinusoidal function of time. We therefore take 


f(t) = Fosinut, 


where Fo and w are positive constants. Under these circumstances, 
equation (56) can be written in the form 

d?x dx 
me ar, + wre = agsinut, (57) 
where [ = 7/2m, wo = \/k/m and ag = Fo/m are all positive constants. 
Here, wo is the angular frequency of the simple harmonic oscillator (in the 
absence of damping or external forces). We include the subscript zero to 
distinguish wo from the angular frequency w of the driving force. 


According to Procedure 2, the general solution of equation (57) is the sum 
of a complementary function «,(t) and a particular integral x(t). The 
complementary function is the general solution of the associated 
homogeneous equation 


dx dx 9 
ge tba, + wor = 9, 
and you saw in Subsection 2.4 that this general solution contains an 


exponentially decaying factor e~!*. Because of this factor, the This decay is a direct 


complementary function represents a short-lived or transient contribution consequence of the damping 
to the motion, which eventually dies away. Once this transient contribution fT¢e- 

has become negligible, the system settles down to a steady-state motion, 

given by the particular integral, which we focus on here. 


The required particular integral can be found using the methods of 
Subsection 3.2. We can substitute a trial solution 


x(t) = pcos(wt) + qsin(wt) 


into differential equation (57), and find p and q by matching the 
coefficients of cos(wt) and of sin(wt) on both sides of the equation. The 
method is identical to that used in Example 10. 


The algebra gets a little messy, so we now introduce an alternative 
approach, which is more efficient in this case. The key ingredient is Euler’s 
formula, which tells us that 

age““t = ag cos(wt) + iag sin(wt), 
the imaginary part of which is equal to the right-hand side of 
equation (57). We therefore consider a differential equation that is closely 
related to equation (57), namely 

d?z dz . 

bolas oP 2, — tut 58 

TD 7 di + Woz = age (58) 
Because the right-hand side of this equation is complex, the function z(t) 
in this equation must also be complex. 


199 


Unit 3. Second-order differential equations 


200 


Substituting z(t) = u(t) +7v(t) into equation (58), and equating real parts 
and imaginary parts on both sides, we get 


d?u du 

Te ~ a + wu = ag cos(wt), 
d2v du . 

ae + aT + we = ao sin(wt). 


It follows that v(t), which is the imaginary part of z(t), satisfies the main 
equation of interest, equation (57). Our tactic will therefore be to solve 
equation (58) for z(t) and then take its imaginary part; this is an effective 
shortcut because equation (58) is easier to solve than equation (57). 


Substituting the trial solution z = pe’* into equation (58), we get 


[ (iw)? + 2P'(iw) + we] pe™* = age”, 


from which we obtain the particular integral 
eit 

z(t) = a =. 

(t) ° ae — uw + Tw 


Multiplying top and bottom by w? — w? — 2iT'w and using Euler’s formula, 
this can be written as 


(t) [cos(wt) + isin(wt)] [(w3 — w”) — 2 w] 
z = aq ee or 
: (w2 — w?)? + 474? 
The solution that we need is the imaginary part of this expression. 
Multiplying out the brackets and picking out the coefficient of i, we 
conclude that equation (57) has the particular integral 
(t) (we — w*) sin(wt) — 2Twcos(wt) 
x(t) = a9 — >, 
(w2 — w?)? + 4T7o? 


which is of the expected form 


(59) 


x(t) = pcos(wt) + qsin(wt), 
where 
—2Tw we — w*)a, 
>= 35, ne ane eee cee }a0 : 
(we — w?)? + 40%? (we — w?)? + 4T*w? 
The interpretation of this expression is clarified by writing it in the form 


z(t) = Asin(wt + ¢), (60) 


p= 


where A is the amplitude and ¢ is the phase constant. These can be found 
by using the argument that led to equations (31) and (32) in 


Subsection 2.4. We have A = \/p? + q? and tan ¢ = p/q, so 
ag 


A= (61) 
(we — w?)? + AT 22 
and 
2Tw 
tang = omer (62) 
0 


Equation (60) gives the response of the system to the driving force once 
the transient motion associated with the complementary function has died 
away. Not surprisingly, the system performs a sinusoidal oscillation with 
the same angular frequency w as the driving force. However, the 
amplitude A and phase constant @ behave in interesting ways. We focus on 
cases where the oscillator is underdamped (I < wo). 


Figure 12(a) shows the amplitude A as a function of the driving angular 
frequency w for four different values of the damping parameter [’. In each 
case, the amplitude is greatest when w is equal to the natural angular 
frequency wo of the corresponding undriven undamped oscillator. This 
effect appears as a peak in the graph, which becomes higher and narrower 
as the damping parameter I is reduced. This phenomenon is called 
resonance, and the angular frequency w = wo at which the response is 
greatest is called the resonant angular frequency. 


A* or 


& 
[=] 
ey 


(a) (b) 


Figure 12 The response of a forced underdamped harmonic oscillator 
with wo = 1 as a function of the driving angular frequency w: 

(a) the amplitude A for T = 0.50, f = 0.25, f = 0.10 and I = 0.05; 

(b) the corresponding phase constant ¢ 


The phase constant ¢ is plotted in Figure 12(b). In general, the oscillation 
of the responding system lags behind the driving force, so ¢ is negative. 
The phase constant does not depend on the amplitude of the driving force, 
but it does depend on its angular frequency. At low angular frequencies, 
the lag is small. At the resonant angular frequency, the lag is a quarter of 
a cycle, and this grows to half a cycle at frequencies that are much larger 
than wo. 


Exercise 23 


Suppose that the displacement x(t) of a forced damped harmonic oscillator 
obeys the inhomogeneous equation 
2 
a + ore + wx = ap cos(wt), 
where I’, wo and ag are positive constants. Use the method based on 
equation (58) to find a particular integral for this equation, and hence 
obtain an expression for the amplitude as a function of w. 


5 Resonance 


The resonance behaviour 
illustrated in Figure 12 appears 
only if the oscillator is 
underdamped. A forced 
overdamped oscillator still 
performs sinusoidal oscillations, 
but its amplitude always 
decreases as w increases. 


In plotting the graph in 

Figure 12(b), it was assumed 
that ¢ is a continuous function 
of w. The arctan function was 
therefore allowed to extend 
beyond the range from —7/2 to 
1/2. 


201 


Unit 3. Second-order differential equations 


Exercise 24 


Use equation (61) to obtain simple formulas for the amplitude of 
oscillation of a forced underdamped oscillator in the following cases. 


Express all your answers as simple fractions (involving ag, wo, [ and w as 
necessary). You will need to make suitable approximations in the first two 
cases. 


(a) w<X wo (b) w > up (c) w=wo 


Examples of resonance effects 


Resonance occurs when an oscillation is sustained by an external 
influence, and the amplitude of the oscillation peaks strongly at a 
particular angular frequency of the external influence. 


A familiar example occurs when you push a swing. Swings are 
designed to have little friction, so Tis small and there is a 
pronounced resonance effect. You can get a motion of large amplitude 
if you push at the same angular frequency as the natural angular 
frequency of the swing. 


Simple radio receivers (Figure 13) use the phenomenon of resonance 
to select which radio station to receive. Every radio station works at 
a different ‘carrier’ frequency. A circuit inside the receiver obeys the 
same equation as the damped harmonic oscillator, with electrical 


Figure 13 A simple radio current playing the role of displacement x(t). Tuning the radio 
receiver, which uses the receiver changes the natural frequency of this circuit, bringing it into 
phenomenon of resonance to resonance with the frequencies of various radio stations. In a radio 


select which transmission to 


seeene receiver, [is very small, so the receiver responds to only a very 


narrow band of frequencies, allowing a single station to be selected. 


The image shown in Figure 14 is from a magnetic resonance imaging 
(MRI) machine. In this case, the oscillating quantity is provided by 
magnetic properties of atomic nuclei, and the resonant frequency 
depends on the strength of an applied magnetic field. Images are 
obtained by placing the patient in a strong non-uniform magnetic 
field, so that different parts of the body are characterised by different 
resonant frequencies. 


Finally, when a light wave (or other electromagnetic wave) is shone on 
a material, it causes the electrons in the material to oscillate to and 
fro. In a classical model, the electrons behave like forced damped 
harmonic oscillators, with the driving force supplied by the light 


Figure 14 A false-colour wave. Energy is transferred from the light wave to the oscillating 
MRI image of a brain of a electrons, so some of the light is absorbed. We can therefore use the 
healthy volunteer, equations of forced damped harmonic motion to model the absorption 


emphasising the veins on top 


and the basal ganglia beneath of light in materials. 


202 


The absorption of light by single atoms is best described by quantum 
mechanics. Rather remarkably, however, resonance peaks in 
absorption are still predicted. Figure 15 is an image of the spectrum 
of sunlight. Among the dark lines there are lines due to absorption of 
light by helium in the solar atmosphere. These lines are found where 
the frequency of vibration of the sunlight is in resonance with 
frequencies of helium atoms. This approach demonstrated the 
existence of helium many years before it was isolated on the Earth. 


oLrea 
viour 
X-Rays 


GAMMA 
mays 


Figure 15 Dark lines in the spectrum of sunlight, due to a 
resonance with atoms in the atmosphere of the Sun, which 
demonstrates the existence of helium 


Learning outcomes 


After studying this unit, you should be able to do the following. 


Understand and use the terminology relating to linear 
constant-coefficient second-order differential equations. 


Understand the key role of the principle of superposition in the 
solution of linear constant-coefficient second-order differential 
equations. 


Obtain the general solution of a homogeneous linear 
constant-coefficient second-order differential equation using the 
solutions of its auxiliary equation. 


Use the method of undetermined coefficients to find a particular 
integral for an inhomogeneous linear constant-coefficient second-order 
differential equation with certain simple forms of right-hand-side 
function. 


Obtain the general solution of an inhomogeneous linear 
constant-coefficient second-order differential equation by combining its 
complementary function with a particular integral. 


Use the general solution together with a pair of initial or boundary 
conditions to obtain, when possible, a particular solution of a linear 
constant-coefficient second-order differential equation. 


Learning outcomes 


203 


Unit 3. Second-order differential equations 


Recall that e° = exp(0) = 1. 


204 


Solutions to exercises 


Solution to Exercise 1 


(a) 


In equations (i)—(vi), the (dependent, independent) variable pairs are 
all (y,z). In equations (vii), (viii) and (ix) they are (t,6), (x,t) and 
(x,t), respectively. 

Equations (i), (ii), (iii), (iv), (vii), (viii) and (ix) are linear. 
(Equations (v) and (vi) are non-linear.) 

All of the linear equations are constant-coefficient except for (iv). So 


the linear constant-coefficient equations are (i), (ii), (iii), (vii), (viii) 
and (ix). 


Of the linear constant-coefficient equations, only (iii) and (ix) are 
homogeneous. 


Solution to Exercise 2 


(a) 
(b) 
(c) 


7 —5\+6=0 
-9=0 
7 +2’ =0 


Solution to Exercise 3 


(a) 


The auxiliary equation is \* +5 +6 = 0. This can be factorised as 
(A + 2)(A + 3) = 0, giving the roots \4; = —2 and 2 = —3. The 
general solution is therefore 


y=Ce *+De*, 
where C and D are arbitrary constants. 


The auxiliary equation is 2A? + 3A = 0. This can be factorised as 
(2X + 3) = 0, so its roots are A; = 0 and Az = —3. The general 
solution is therefore 


y= Ce ae De-32/2 =U Dera. 
where C and D are arbitrary constants. 


The auxiliary equation is \* — 4 = 0, i.e. \* = 4, so its roots are 
Ay = —2 and Az = 2. The general solution is therefore 


e=0e™ + De, 


where C and D are arbitrary constants. (This differential equation is 
a special case of equation (14) discussed in Subsection 2.1.) 


Solutions to exercises 


Solution to Exercise 4 


(a) The auxiliary equation is \* + 4\ + 8 = 0, which has solutions 


y Ae VIO 82 _ 
== = 


The general solution is therefore 


2: 21. 


y =e **(C'cos 2x + Dsin2z). 
(b) The auxiliary equation is \* + 9 = 0, which has solutions 
A= +37. 


These are of the form a + 32, where a = 0 and 6 = 3. 
The general solution is therefore 
@ = e°(C cos 3t + Dsin 3t) = Ccos 3t + Dsin3t. 
Of course, this is one of the simple cases discussed at the beginning of 
this section: it corresponds to simple harmonic motion. 
Solution to Exercise 5 


(a) The auxiliary equation is \* + 2\ +1 =0, which can be factorised as 
(\ +1)? =0, giving equal roots \; = Az = —1. The general solution is 
therefore 


y=(C+Dz)e", 
where C and D are arbitrary constants. 


(b) The auxiliary equation is \* — 4\ + 4 = 0, which can be factorised as 
(\ — 2)? =0, giving equal roots \; = Az = 2. The general solution is 
therefore 


s=(C+Dt)e”, 


where C and D are arbitrary constants. 


Solution to Exercise 6 


(a) The auxiliary equation is \? + 4 = 0, which has solutions \ = +21. 
The general solution is therefore 


y = Ccos2z + Dsin2z. In all cases, C and D are 


‘ F F i bitr tants. 
(You could also have written down this general solution directly from eee eet 


equation (18).) 


(b) The auxiliary equation is A? — 9 = 0, which has solutions \ = +3. The 
general solution is therefore 


y= Ce + De. 


(You could also have written down this general solution directly from 
equation (15).) 


(c) The auxiliary equation is \” + 2A = 0, which has solutions \; = 0 and 
Ag = —2. The general solution is therefore 


y= C+ De, 
205 


Unit 3. Second-order differential equations 


206 


(d) The auxiliary equation is \* — 2\ + 1 = 0, which has solutions 
A, = Ag = 1. The general solution is therefore 


y=(C+ Dr)e’. 
(e) The auxiliary equation is \* + 4\ + 29 = 0, which has solutions 
—4+ /16-—116 . 
A= —>— = —2+ 51. 


The general solution is therefore 
y =e **(Ccossa + Dsin5da). 


(f) The auxiliary equation is \7 — 6\ + 8 = 0, which factorises as 
(A — 4)(A — 2) = 0 and has solutions A; = 4 and Ap = 2. The general 
solution is therefore 


u= Ce” + De. 


Solution to Exercise 7 
(a) The auxiliary equation is 
3A —1- 2d? =0, 
or equivalently, 
2.7 -3\+1=0, 
which factorises as 
(2A —1)(A—1) =0. 


(b) The two solutions of the auxiliary equation are 1 = s and Az = 1, so 
the general solution of the differential equation is 


y = Ce*/? + De®, 


where C and D are arbitrary constants. 


Solution to Exercise 8 
(a) The auxiliary equation is 
M7 4+2142=0. 


This has solutions Ay = —1 +7 and Ag = —1 —7, so the general 
solution is 


y=e *(Ccosz+ Dsinz). 
(b) The auxiliary equation is 
N16 =0. 
This has solutions A, = 4 and Ag = —4, so the general solution is 
y = Ce** + De~*. 
(c) The auxiliary equation is 


M44 +4=0. 


This has solutions A, = Ag = 2, so the general solution is 


y = (C+ Dx)e? 
(d) The auxiliary equation is 
N+ 3A = 0. 
This has solutions Aj; = 0 and Az = —3, so the general solution is 
@6=C+De™*. 


Solution to Exercise 9 


The auxiliary equation \? + 4k\ + 4 = 0 can be solved using the formula 
method to give \ = —2k 42VWk? —1. So there are complex conjugate 
solutions, leading to a general solution involving sines and cosines, when 
k? <1, ie. when -—1<k <1. 


Solution to Exercise 10 


(a) The differential equation can be written in the form 


y 0.016 
2m 2 x 0.80 oars 


and the angular frequency of the corresponding undamped oscillator is 


mg 9.8 
=,/— =4/— = 2.21. 
Yim V2.0 


We therefore have [ < w, and the oscillation is underdamped. 


(b) The angular frequency of the damped pendulum is 


= Vw? —I% = 2.21? — 0.01? = 2.21. 


This is typical: the angular frequency is not very sensitive to damping 
when [ < w. 


We ee have 


pa 7“ = 2.84, 
Q 


so the period of oscillation is 2.8 seconds. 


(c) The amplitude is given by Ae~'*, where A is a constant. If the 
amplitude is 0.2 radians at time zero, the amplitude at time t = 100 is 


Ae**t = 0,20e~°-91*109 — 9.2e-! = 0.074 (in radians). 


Solution to Exercise 11 


The condition for critical damping is T = w = ,/k/m, so in this case 
Tl = /4=2. The general solution is therefore 


x(t) = (C + Dt)e~*. 


Solutions to exercises 


207 


Unit 3. Second-order differential equations 


Solution to Exercise 12 


(a) The associated homogeneous equation is 
d?y 
—+4y=0. 
dx? le 
The complementary function (see Exercise 6(a)) is 
Ye = C cos 2a + Dsin 22x. 


Trying a solution of the form yp, = p, where p is a constant, in the 
original equation d?y/dx? + 4y = 8 gives 0 + 4p = 8, so p= 2. Thus a 
particular integral is 
Yp = 2. 
By Procedure 2, the general solution is 
y = Ccos2a + Dsin 2a + 2. 
(b) The associated homogeneous equation is 
dy dy 
— —3— + 24y = 0. 
dx? dx | 
The complementary function (see Example 3) is 
ie =e + De™. 


Trying a solution of the form yp = p in the original equation 
d?y/dx* — 3dy/dx + 2y = 6 gives 0O—0+2p=6, sop=3. Thusa 
particular integral is 

Yp = 3. 
By Procedure 2, the general solution is 


y = Ce* + De* +3. 


Solution to Exercise 13 


(a) Substituting y = pix + po and its derivatives into the differential 
equation gives 


0 — 2p; + 2(pix + po) = 2pix + (2po — 2p) = 2a + 3. 


Equating the coefficients of x gives p; = 1, and equating the constant 
terms gives pg = 3 Therefore a particular integral is 


Y =@t+3. 


(b) Substituting y = p1x + po and its derivatives into the differential 
equation gives 


0+ 2p; + (pie + po) = pix + (2p, + po) = 2a. 


Equating the coefficients of x gives pj = 2, and equating the constant 
terms gives po = —4, so a particular integral is 


Yp = 2a — 4. 


208 


Solution to Exercise 14 


We try y = pot? + pit + po, which has derivatives y’ = 2p2t + p, and 
y" = 2po. Substituting these into the differential equation gives 


2p2 — (pot? + pit + po) = pot? — pit + (2p2 — po) 


ae a 
Hence, separately equating coefficients of t?, t and 1, we get po = —1, 
p, = 0, po = —2, so a particular integral is 


Yp = = 


Solution to Exercise 15 


x 


We try a solution of the form y = pe~*, which has derivatives 
dy/dx = —pe~* and d?y/dx? = pe~*. Substituting these into the 
differential equation gives 


2pe ” + 2pe"* + pe” = dpe * = 2e *. 


Hence p = 2, and a particular integral is 


Solution to Exercise 16 
We try y = pcos 3t + qsin 3t, which has derivatives 


d d? 
— = —3psin 3t + 3qcos 3t, = = —9pcos 3t — Iqsin 3t. 
Substituting into the differential equation gives 
(—9p cos 3t — 9qsin 3t) — (—3psin 3t + 3q cos 3t) 
= —(9p + 3q) cos 3t + (3p — 9q) sin 3t 
= cos 3t + sin 3t. 


Hence we have a pair of simultaneous equations 


—9p — 3q= 1, 
3p —9q=1. 
Adding three times the second equation to the first shows that 
q= = = —, hence p = +. A particular integral is thus 
Yp = -+ cos 3t — A: sin 3¢. 


Solution to Exercise 17 


(a) Try y = pe**. 
(b) Try y = pcos3z + qsin3z. 


Solutions to exercises 


209 


Unit 3. Second-order differential equations 


Solution to Exercise 18 
(a) The complementary function is 
6. = C'cos 2t + D sin 2t. 


To find a particular integral, try 0 = pit + po. Substituting this and 
its derivatives into the differential equation gives 


A(pit + po) = 2t. 

Hence pi = $, po = 0, and a particular integral is 
Op = st. 

Therefore the general solution is 
6 = C'cos 2t + Dsin 2t + st, 

where C and D are arbitrary constants. 

(b) The complementary function is 

Ye = C cos 2a + Dsin 2x. 


The right-hand-side function is 10 sin 3x, so we try 
y = pcos3x + qsin3z. The derivatives are 


dy 

— = —3psin 3x + 3qcos 32, 
dx 

dy . 

—> = —9pcos 3x — Iqsin 3x. 
dx? 


Substituting into the differential equation gives 
(—9p cos 3x — 9qsin 3x) + 4(pcos 3x + qsin 3x) 
= —5pcos 3x — 5qsin3x = 10sin 32, 

so p= 0 and q = —2, and a particular integral is 
Yp = —2sin dz. 

Therefore the general solution is 
y = Ccos2z 4+ Dsin 2x — 2sin 32, 

where C and D are arbitrary constants. 

(c) The complementary function is 

j=Ce De. 


The right-hand-side function is 15e~”, so we try y = pe-”. The 
derivatives are 


dy 4 ay a5 
ae a ae 


Substituting for d?y/dx? and y into the differential equation gives 
pe * —4pe—* = —3pe"” = 1l5e*, 
so p = —5, and a particular integral is 


j= oe". 


210 


Therefore the general solution is 
y= Ce 4 De — te *, 


where C and D are arbitrary constants. 


Solution to Exercise 19 


We split the task of finding a particular integral into two parts, by first 
finding particular integrals for 


d?x dx 
2— —+4+27=12 2t 
TE vo + 2a COs (63) 
and 
dx dx 
2— —+2x = 10. 4 
7D Te + 2x = 10 (64) 


In equation (63), the term 12 cos 2t on the right-hand side suggests the 
trial solution « = pcos 2t + qsin 2t. This has derivatives 


dx . ea F 
a —2psin 2t + 2qcos 2t, 7 —4p cos 2t — 4qsin 2t. 
Substituting into the left-hand side of the differential equation gives 
2(—4p cos 2t — 4qsin 2t) + 3(—2psin 2t + 2q cos 2t) + 2(pcos 2t + qsin 2t) 
= 6(q — p) cos 2t — 6(p + q) sin 2t. 
Equating this to 12 cos 2t from the right-hand side of equation (63) gives 
pt+q=0,q—p=2. Hence p = —1, g = 1, and a particular integral is 
Lp = — cos 2t + sin 2¢. 


Now consider equation (64). In this case we try a constant function 
x = po. Substituting into the differential equation gives 2p9 = 10, so 
po = 5, and a particular integral is 


Lp = 6. 


Therefore, using the principle of superposition, a particular integral for the 
original differential equation is 


Lp = — cos 2t + sin 2t + 5. 


Solution to Exercise 20 
(a) The associated homogeneous equation has auxiliary equation 
7 +4. +5 =0, 


which has solutions 


—4+ 16 — 20 . 
A= —— 7 8 


So the complementary function is 


Ue =e **(C cost + Dsint). 


Solutions to exercises 


211 


Unit 3. Second-order differential equations 


To find a particular integral, try u = po. Substituting gives 5p9 = 5. 
Hence pp = 1, and a particular integral is 


Up = I. 
Therefore the general solution is 
u=e *(Ccost + Dsint) +1, 
where C and D are arbitrary constants. 
(b) The associated homogeneous equation has auxiliary equation 
8X7 = 2)=—1=C6A4+ DA=1)=0, 


which has solutions A; = 1 and Ag = — 
function is 


Yo = Ce® + De~*/?, 


=. So the complementary 


The right-hand side function is e?”, so to find a particular integral, we 
try y = pe?”. The derivatives are 
dy on ay 
— = 2ne**, —= = Ape**. 
dx e dx? . 
Substituting gives 
3(4pe?”) — 2(2pe?*) — pe?* = Tpe?* = e?*. 
Hence p = z, and a particular integral is 


— 1 ,2a 
Yp = 7e . 


Therefore the general solution is 
y= Ce® + De~#/3 ti ze", 


where C and D are arbitrary constants. 


Solution to Exercise 21 
(a) The derivative of the general solution u = C cos 3t + D sin 3t is 
u’ = —3C sin 3t + 3D cos 3t. 


Remember that cos (#7) = 0 Substituting the initial condition t = 5, u = 0 into the general solution 
3m 


and sin (37) = —1. gives D = 0. Substituting the initial condition t = 5, u’ = 1 into the 
derivative gives C = 4. Hence the required particular solution is 
3 
u= = cos 3t. 
(b) The derivative of the general solution u = e~*4(C cost + Dsint) +1 is 
ul = —2e—**(C cost + Dsint) + e~**(—C sint + Dost) 
= ¢ *[(D — 2C) cost — (2D + C)sint). 
Substituting the initial condition u = 3, t = 0 into the general solution 
gives C = 2. Substituting the initial condition u’ = 1, t = 0 into the 


derivative gives D— 2C = 1, so D=5. So the required particular 
solution is 


u=e *(2cost+5sint) +1. 


212 


Solution to Exercise 22 
(a) The derivative of the general solution u = C cos 2x + Dsin 2z is 
u’ = —2C sin 2x + 2D cos 2z. 
This part specifies an initial-value problem. 


The initial condition u(0) = 1 gives C = 1. The initial condition 
u'(0) = 0 gives D = 0. The required solution is therefore 


u = cos 2a. 
(b) This is a boundary-value problem. 


The boundary condition u(0) = 0 gives C = 0. The boundary 


condition u(3) = 0 gives C = 0 also. D therefore remains arbitrary, so 


there is an infinite number of solutions, of the form 
u = Dsin2z. 
(c) This is an initial-value problem. 
The initial condition u(Z) = 0 gives C = 0. The initial condition 


u! (4) = 0 gives D = 0. The required solution is therefore the zero 


function 
u=0. 
(d) This is a boundary-value problem. 
The boundary condition u(—7) = 1 gives C = 1. The boundary 
condition u(4) = 2 gives D = 2. The required solution is therefore 
u = cos 2x + 2sin2z. 


Solution to Exercise 23 


The differential equation given in this question is the real part of 
equation (58), so its solution is the real part of equation (59). Hence a 
suitable particular integral in this case is 


iiee (w2 — ae! =! + a eat 

(w6 — w*)? + 40% 

Using equation (31), the amplitude is the square root of the sum of the 
squares of the coefficients of sin(wt) and cos(wt), and so is given by 


1/2 
A 2 (wo—u*)? +407? | ag 
— ao eo = . 
((w% — w?)? + AT 232)” (w2 — w?)? + 40 2w? 


This is the same as equation (61), which is not surprising. Changing the 
right-hand side of the equation from ag sin(wt) to a9 cos(wt) corresponds to 
changing the phase constant of the external force (which depends on the 
origin chosen for time). It is to be expected that this does not affect the 
amplitude of the steady-state forced damped oscillations. 


Solutions to exercises 


213 


Unit 3. Second-order differential equations 


214 


Solution to Exercise 24 


(a) 


Considering the expression 
ao 


when w < wo, we can neglect w? in comparison with w2. We can also 
neglect 4T?w? in comparison with wé (because P < wo for an 
underdamped oscillator). We therefore have the approximation 


a 
Art a for w << wo. 
Wo 
In this case, the amplitude is a constant that is independent of the 
angular frequency of the external force or the damping parameter of 
the oscillator. 


When w > wo, we can neglect w% in comparison with w?. In the 
underdamped case, [ < wo and we can also neglect 407w? in 
comparison with wt. We therefore have the approximation 

Aw th for w > wo. 

Ww 

In this case, the amplitude decreases steadily as the angular frequency 
of the external force increases. The sluggish system is unable to keep 
up with the fast oscillations of the external force. 


Substituting w = wo into the general formula for the amplitude gives 
= OP 

This is the amplitude at the peak of resonance. It increases without 

limit as the damping parameter I is decreased. 


Acknowledgements 


Acknowledgements 


Grateful acknowledgement is made to the following sources: 

Figure 13: Taken from 

http: //en.wikipedia.org/wiki/File:Sony-walkman-srfs84s_0001.JPG. This 
file is licensed under the Creative Commons Attribution-Share Alike 3.0 
Unported licence. 


Figure 14: Ericl Liu. This file is licensed under the Creative Commons 
Attribution-Share Alike 3.0 Unported licence. 


Figure 15: Andrew P. Harmsworth, Science, Space, Education & IT Skills. 
www.harmsy.freeuk.com/fraunhofer.html. 


Every effort has been made to contact copyright holders. If any have been 
inadvertently overlooked, the publishers will be pleased to make the 
necessary arrangements at the first opportunity. 


215 


Index 


absorption coefficient 88 
amplitude 174 
analytic solution 102 
angular frequency 
of a driving force 199 
of a simple harmonic oscillator 174 
of an underdamped harmonic oscillator 175 
arbitrary constant 51, 152, 155 
arccos 26, 27 
arcsin 27 
arctan 27 
Argand diagram 34 
argument of a complex number 34 
associated homogeneous equation 179 
asymptote 48 
auxiliary equation 164 


base 16 

Bernoulli, Jacob 21 
boundary condition 194 
boundary value 194 
boundary-value problem 194 


carbon-l4 90 

chain rule 44 

characteristic equation 164 
coincident roots 30 
complementary function 179 
complex conjugate 31 
complex exponential 35 
complex number 30 
complex-valued function 42 
composite rule 44 
composition of functions 18 
constant of integration 51 
cosecant 25 

cosine 24 

cotangent 25 

critically damped motion 176 


damped harmonic oscillator 153, 172 

de Moivre, Abraham 36 

de Moivre’s theorem 36 

decay constant 90, 175 

definite integral 59 

dependent variable 6, 96, 104 

derivative 38, 39, 96 

derived function 38 

difference of two squares 15 

differentiable function 98 

differential equation 92, 97 
explicit general solution 108 


Index 


first-order 92, 97 
general solution 98 
homogeneous 111, 158 
implicit general solution 108 
inhomogeneous 111, 158 
linear 111, 158 
linear constant-coefficient 158 
non-homogeneous 111, 158 
order 97 
particular solution 98 
separable 105 
solution 97 
differentiation 37, 38 
of a complex-valued function 42 
direct integration 103 
direction field 121 
discriminant 14 
division of complex numbers 31, 36 
domain of a function 6, 53, 101 
driving force 199 


equal roots 30 

equation of motion 
of a damped harmonic oscillator 153 
of a forced damped harmonic oscillator 153 
of a simple harmonic oscillator 152 

Euler, Leonhard 21 

Euler’s formula 34 

Euler’s method 128 

Euler’s number 21 

even function 62 

explicit general solution 108 

exponent 16 

exponential behaviour 185 

exponential decay 88, 94 

exponential dependence 24 

exponential form of a complex number 35 

exponential function 19 

exponential growth 94 


factorisation 14 

Feynman, Richard 35 

first derivative 41 

first derivative test 48 

first-order differential equation 92, 97 
forced damped harmonic oscillator 153, 199 
frequency 174 

function 6 

function notation for derivatives 40 


Gaussian elimination 10 
general solution of a differential equation 98, 103, 108, 
115, 156 


217 


Index 


homogeneous linear constant-coefficient 170 multiplication of complex numbers 36 
inhomogeneous 180, 181 

global maximum 48 natural angular frequency 174 

global minimum 48 natural logarithm 22 

gradient 9, 39 Newton, Isaac 41 

Newtonian notation for derivatives 41 

half-life 90 Newton's second law 95, 151 

homogeneous differential equation 111, 158 non-homogeneous differential equation 111, 158 
general solution 170 nth derivative 41 


nth root 16 
nth-order polynomial 30 
numerical analysis 124 


image set of a function 6 
imaginary part of a complex number 30 
implicit differentiation 45 
implicit general solution 108 P 
indefinite integral 51, 103 odd Sata 
independent variable 6, 96, 104 
index 16 
inhomogeneous differential equation 111, 158, 178 
general solution 181 
initial condition 100, 192 
initial value 100, 192 
initial-value problem 100, 192 
integral 52 
of an odd function 63 
integrand 51 
integrating factor 115 
integrating factor method 115 
integration 50 
by change of variable 56 
by parts 58 
by substitution 56 
inverse function 7 
inverse trigonometric functions 26 


order of a derivative 41 
order of a differential equation 97 
overdamped motion 175 


parabola 12 

parameter 7 

partial differential equation 197 
particular integral 179, 187 
particular solution of a differential equation 98, 156 
perfect square 14 

period of an oscillation 174 
periodic function 25 

phase constant 174 

point of inflection 47 

polar coordinates 33 

polar form of a complex number 34 
polynomial of degree n 30 
population model 92 

power 16 


Leibniz, Gottfried Wilhelm 40 seein function 24 

Leibniz notation for derivatives 40 principal value of the argument 34 
limits of integration 60 principle of superposition 159, 160 
linear constant-coefficient differential equation 112, 158 procedure 


linear differential equation direct integration 103 
first-order 111 Euler’s method 128 


general solution of a homogeneous equation 170 
general solution of an inhomogeneous equation 181 


linearly dependent functions 168 integrating factor method 115 
linearly independent functions 168 particular solution of an inhomogeneous equation 


local maximum 47 187 

local minimum 47 separation of variables 108 

logarithm 22 to find local maxima and minima 47, 48 
logistic equation 94, 110, 121 to solve a quadratic equation 13 


lower limit of integration 60 product rule 43 
proportionate birth rate 93 


second-order 158 
linear function 9 


maximum of a function 47 proportionate death rate 93 
method of undetermined coefficients 182, 187 proportionate growth rate 93 
minimum of a function 47 

mode of oscillation 197 quadratic equation 12 

modulus of a complex number 32, 34 formula method for solving 13 


218 


Index 


quadratic function 12 slope 9 

quantisation of energy 197 solution of a differential equation 97 
quantum corral 4 solution of a quadratic equation 13 
quotient rule 43 stationary point 47 


step length 127 


radioactive decay 89 step size 127 


radiocarbon dating 90 


rate of change 38 tangent 25 
real part of a complex number 30 Taylor series 
repeated roots 30 for the cosine function 29 
resonance 201 for the exponential function 19 
resonant angular frequency 201 for the sine function 29 
restricted values of a function 27 third derivative 41 
roots trial solution 87, 163, 182 

of a polynomial 30 trigonometric functions 24 

of a quadratic equation 13 trigonometric identities 28 

sum of 13 

: ; underdamped motion 174 

scanning tunnelling microscope 4 undetermined coefficients, method of 187 
secant 25 units 


second derivative 41 

second derivative test 47 

separable differential equation 105, 108 
separation of variables method 108 


incorporated into symbols 8 
kept separate from symbols 8 
upper limit of integration 60 


simple harmonic oscillator 151 variable 6 
simultaneous linear equations 10 Voyager spacecraft 95 
sine 24 

sinusoidal function 186 Wigner, Eugene 4 


219 


BOOK 1 Differential equations 


Unitl Getting started 
Unit 2 First-order differential equations 
Unit3 Second-order differential equations 


ISBN 978-1-7800-7479-5 


9°781780°074795 Cover image © Andrija Kova/istockphoto 


